1,588
Views
0
CrossRef citations to date
0
Altmetric
Research Article

Rethinking the author name ambiguity problem and beyond: The case of the Chinese context

ORCID Icon & ORCID Icon
Received 07 Dec 2023, Accepted 25 Apr 2024, Published online: 05 May 2024

ABSTRACT

The perennial problem of author name ambiguity has attracted increasing attention in the academic community. Drawing on the literature, this article first highlights the pervasiveness of the problem and discusses its adverse consequences. It then analyzes the behavioral causes of the problem in the Chinese context and attributes them to personal, cultural, and institutional factors. Informed by this analysis and recognizing ORCID as a promising solution, we propose an ORCID-based “Prevention plus Cure” campaign against author name ambiguity. The prevention objective relies on researchers’ consistent use of ORCID, while the cure objective involves retrospectively integrating ORCIDs into backfile publications. We also outline the responsibilities of various stakeholders to ensure the success of the campaign. Furthermore, we argue that universal adoption of ORCID can help curb authorship-related misconduct, discern predatory journals and publishers, and track researchers’ undesirable records of academic publishing. We then analyze the current status of ORCID adoption in China, identify potential challenges, propose tentative solutions to address them, and highlight ORCID as a tool that can be utilized to empower China’s combat against research misconduct. In conclusion, we emphasize the importance of conducting empirical research to inform more effective promotion of ORCID adoption in China.

Introduction

The author name ambiguity problem is classified by Shin et al. (Citation2014) into two categories: “the namesake problem,” which pertains to identical names, and “the heteronymous name problem” (15), which relates to the use of different names by the same researcher. Identified with not only Chinese researchers but also their international colleagues with names in many other languages (Castillo Citation2009; Strotmann and Zhao Citation2012), the author name ambiguity problem has been attracting steady scholarly attention since the advent of digitalized academic publishing in the mid-1990s (Sanyal, Bhowmick, and Das Citation2021; Smalheiser and Torvik Citation2009). The globalization of science communication, the digitalization of academic publishing, and the need for efficient and accurate author-centric retrieval of bibliographic data necessitates author name disambiguation in the context of all languages. On the one hand, the large number of Chinese researchers have contributed considerably to enriching the scientific literature (Adams et al. Citation2023; To and Yu Citation2020; Woolston Citation2023). On the other hand, the author name ambiguity problem with Chinese researchers has posed a serious challenge to effective and efficient author-centric retrieval of bibliographic information (Harzing Citation2015; Sun and Zhou Citation2002; Tang and Walsh Citation2010). Although considerable progress has been made in trying to solve the problem over the past two decades, we are not there yet (Sanyal, Bhowmick, and Das Citation2021). Against this backdrop, it is now high time to rethink the author name ambiguity problem in the Chinese context and beyond. This article aims to heighten the awareness, in not only Chinese researchers but also the rest of the science community, about the gravity of the author name ambiguity problem and the importance of their joint efforts to solve the problem.

The pervasiveness of the problem and its adverse consequences

The author name ambiguity problem is wittingly summarized by Granshaw (Citation2019) as “the issue with Wang and Smith” (236) because Wang is the most popular surname in China and Smith in five English-speaking countries (i.e., Australia, Canada, New Zealand, the UK, and the USA). Although Smith is a common English surname, it is “only the 123rd most common surname in the world” (Granshaw Citation2019, 239), outnumbered by quite a few Chinese surnames. As estimated by China’s Ministry of Public Security, round 85% of China’s population (i.e., more than 1.1 billion people) shared 129 surnames (Qiu Citation2008). Drawing on the Forebears database of names,Footnote1 Granshaw (Citation2019) found that there were 108, 105, 98, 75, 74 millions of Chinese surnamed Wang, Li, Zhang, Chen, and Liu, respectively and that approximately 40% of Vietnamese population were surnamed Nguyen. Similarly, nearly half of Korea’s population were surnamed Kim, Lee, and Park (Harzing Citation2015). The phenomenon of identical surnames for large proportions of the national populations in these countries suggests that the author name ambiguity problem is likely to be highly pervasive among academics from the countries.

In 2011, Y. Wang was identified as the most productive researcher name, which had appeared in the bylines of 3,926 academic publications, with an incredible publication rate of almost 11 papers per day (Butler Citation2012). Similarly, a search in the Web of Science revealed that Wang, Y had authored 1,116 publications in nanotechnology between 2000 and 2007 (Tang and Walsh Citation2010). Harzing’s (Citation2015) investigation uncovered more myths about Y Wang’s research productivity. Specifically, Y Wang had managed to author around 30,000 articles between 1 January 2005 and 28 February 2015, with a publication rate of almost 9 per day. Y Wang had even published more than 100 papers in each of 73 distinct research areas and was affiliated with more than 500 universities in nearly 100 different countries. Further examination revealed that the initial Y referred to 6 different Pinyin transliterations of Chinese given names (i.e., Yang, Yi, Yuan, Ying, Yan, and Yu) in the last 10 papers by Y Wang. To zoom in on Yang Wang, the name authored 2,500 papers, with a weekly publication rate of 5, and was affiliated to more than 100 universities and even many different appointments within the same university. Similarly, Qiu’s (Citation2008) search for Wang X in PubMed returned 8,904 entries, and Torvik and Smalheiser (Citation2009) came to realize that approximately two thirds of author names in PubMed were ambiguous. A search on the ORCID websiteFootnote2 on 7 December 2023 showed that there were 281,819 instances of Wang, including 2, 863 instances of Wang Wei, and 18,427 occurrences of Smith, including 192 occurrences of John Smith.

The author name ambiguity problem by nature makes it extremely challenging, if not impossible, to conduct efficient author-centric retrieval of bibliographic data. This adverse consequence affects various stakeholders who need a complete and accurate research profile of individual researchers for different purposes (see Smalheiser and Torvik Citation2009). Without efficient and accurate identification of target researchers, it is impossible to conduct reliable assessment of research productivity and impact of individual researchers (Harzing Citation2015; Strotmann and Zhao Citation2012). It is also time-consuming for researchers to identify potential collaborators whose full or abbreviated names happen to be shared with too many peer researchers. Similarly, it is a time-consuming process involving algorithms and manual data curation to compile accurate lists of Highly Cited Researchers and the World’s Top 2% Scientists. Careless research institutions may end up being deceived by researchers who fake their research productivity with a publication list padded with their namesakes’ publications (see a case of Tsinghua University as revealed by Qiu Citation2008). Journals and research funding agencies may be misinformed by a stellar list of publications contributed by multiple namesakes and consequently end up mis-identifying reviewers for manuscripts and grant proposals. For instance, the second author of this article, whose expertise lies in applied linguistics, was once misidentified as a namesake scholar and invited by a European national fund administration to review a grant proposal on e-government.

The causes of author name ambiguity in the Chinese context

Before examining the causes of the author name ambiguity problem in the Chinese context, it is necessary to present some facts about Chinese personal names. First, a Chinese personal name appearing in Chinese characters is made up of a surname and a given name, with the former being followed by the latter. Second, the majority of Chinese personal names consist of a one-character (monosyllabic) surname and a two-character (disyllabic) given name (Du Citation2007). Third, according to a recent national survey conducted by the Household Management Research Center of the Ministry of Public Security (HMRCoMPS, Citation2021), only 100 or so of approximately 1,600 Chinese surnames are frequently used, with the most popular five combined (i.e., Wang [王], Li [李], Zhang [张], Liu [刘], and Chen [陈]) accounting for 30.8% of the registered national population of China. Fourth, only 60 disyllabic surnames were included in The Hundred Surnames, and fewer than 10 remain in use (Dai Citation2006). The Chinese government encourages the naming of newborns with a disyllabic surname (double-barreled surname) composed of their parents’ two monosyllabic surnames to increase unique names (Du Citation2007). Fifth, although a large number of Chinese characters can be used as given names, a limited number of them are preferred, with differences in gender and changes over time (HMRCoMPS, Citation2021). Sixth, the Hanyu Pinyin (hereafter referred to as Pinyin) system was officially introduced by the State Council of China (Citation1978) to transliterate Chinese personal and place names and has been adopted by the International Organization for Standardization (ISO) since Citation1982. Seventh, the tone-marking diacritics are omitted from Pinyin transliterations of Chinese personal names (State Council of China, Citation1978).

The author name ambiguity problem is manifested with four types of namesake in the context of the Chinese language. The predominant use of a limited number of Chinese surnames (Du Citation2007; He Citation2003) and nation-wide preferences for a small number of Chinese characters as given names (HMRCoMPS, Citation2021) result in a huge number of personal names identical both in Chinese characters and in Pinyin, which are identified as genuine namesakes in this article. As Mandarin Chinese is rich in homophones (Kałużyńska Citation2015; Liu, Shu, and Li Citation2007; ISO Citation2015), many Chinese personal names are homophonic namesakes, which are different in Chinese characters but identical in pronunciation and thus in Pinyin. For instance, 6 different surnames in Chinese characters (i.e., 计纪季蓟暨冀) are all pronounced as (Dai Citation2006), and at least 9 different Chinese characters that may be picked as a given name (i.e., 韦伟玮炜纬苇帷闱嵬) are all pronounced as Wěi. The practice of not including tone markers in Pinyin transliterations of Chinese personal names can lead to homographic namesakes, whose corresponding names in Chinese characters are different. For instance, Hu Guangwei is a shared Pinyin transliteration of many possible different Chinese personal names if their tone markers are removed, such as Hú Guāngwěi, Hú Guǎngwéi, Hù Guāngwěi, and Hù Guǎngwèi, each of which may refer to many different personal names in Chinese characters. Initial-based namesakes may occur when Pinyin transliterations of personal names are presented as a combination of a surname and initials of a given name. For instance, Hu G. may refer to Pinyin transliterations of many different personal names, such as Hu Guang, Hu Guangwei, Hu Gong, Hu Gongxun, Hu Ge, and Hu Gehuan, each of which may refer to many different personal names in Chinese characters.

The ambiguity problem with Pinyin transliterations of Chinese personal names can be further complicated by the ordering of given names and surnames, the number of their syllabi, and the (dis)connection of disyllabic given names and surnames. Despite the adoption of the surname-first order (State Council of China, Citation1978), the surname-last order has been usually adopted proactively or passively when Chinese researchers publish in English-medium venues. Changing the order of the monosyllabic given name and the monosyllabic surname may make little, if any, difference to the possibility of a personal name having namesakes, as illustrated by Qiu (Citation2008) with the examples of Wei Yang (魏洋) and Yang Wei (杨卫). In cases of a combination of a monosyllabic surname and a disyllabic given name, the two syllabi of the given name may be either hyphenated or separated. Separating a given name’s two syllabi may lead to the second syllable being misidentified as a monosyllabic surname. For instance, Xu Shaoxiong, when presented as Xu Shao Xiong, may be misidentified as Xiong Shao Xu and/or Xiong Shaoxu, which may refer to many different personal names in Chinese characters. Similarly, the hyphenated two syllabi of a given name may end up being misidentified as a disyllabic surname. For example, Wan Ling-Hu may be misidentified as Ling-Hu Wan, with the surname in the former being Wan (万) and in the latter Ling-Hu (令狐). Considering that there are just a few dozens of traditional disyllabic Chinese surnames (Qiu Citation2008), the possibility of mixing up disyllabic surnames with disyllabic given names is probably low for Chinese-knowing readers.

It should be noted that there are institutional inconsistencies in hyphenating non-monosyllabic given names and capitalizing surnames in Pinyin. The Pinyin system allows the capitalization of the first letter of surnames but disallows the hyphenation of polysyllabic given names (State Council of China, Citation1978). In contrast, the data norm for retrieval and evaluation of Chinese academic journals (hereafter referred to as the CAJ data norm) issued by the Editorial Committee of Chinese Academic Journals (ECCAJ Citation1999), as well as its revised version (ECCAJ Citation2006), mandates full capitalization of surnames and hyphenation of non-monosyllabic given names and double-barreled surnames. Such a requirement of hyphenation is not always followed in practice. Notably, the CAJ data norm is only applicable to English translations of publication by-lines in Chinese academic journals. As a result, Chinese researchers who have published in both Chinese and English journals may not hyphenate their non-monosyllabic given names in Pinyin when publishing in English-medium international journals.

Although the surname-first order is officially endorsed and promoted by ISO as an international standard for Chinese personal names in Pinyin, Chinese researchers have to apply the surname-last order most of the time when publishing in international outlets. As English has been a lingua franca for modern science communication (Mauranen Citation2003; Tardy Citation2004), most internationally well-recognized academic journals and bibliographic databases, which are owned and operated by publishers in English-speaking countries, follow the Anglo convention of placing surnames after given names and allowing the use of initials for given names. Additionally, bibliographic databases may index an additional set of author names in the form of surnames plus initials of given names even though given names are fully spelled in publication by-lines. Chinese researchers with educational and research experiences in Western countries tend to anglicize their surname position. Notably, it is a common practice within the Chinese communities in Hong Kong and Singapore that an anglicized given name precedes a transliterated Chinese name following the surname-first order (e.g., Tony Leung Chiu-wai and Lawrence Wong Shyun Tsai).

Since researchers do not have to use their legal names for science communication, some Chinese researchers with educational and/or research experiences in Western countries may adopt a Western given name and place it in front of their Chinese name following the surname-last order to form a hybrid new name for academic purposes (e.g., Andy Xuesong Gao). Some researchers may parenthesize their Western given name and place it between their Chinese given name and surname. For example, the first author of this article used to appear as Shaoxiong (Brian) Xu in his first six publications, in which the parenthesized English name was intended to better distinguish him. Unexpectedly, his hybrid name tends to appear on reference lists, without the initial of his Western given name, very likely because of the parentheses. As a result, he decided to remove the unhelpful parentheses and use the new hybrid name (i.e., Shaoxiong Brian Xu) throughout the rest of his academic career. Such a decision is supported by Harzing’s (Citation2015) finding that researchers cited with more initials are less likely to be misidentified. A well-established scholar in the field of applied linguistics has also adopted more than one hybrid name, publishing as both Andy Xuesong Gao (e.g., Zheng and Gao Citation2016) and Xuesong (Andy) Gao (e.g., Gao and Zheng Citation2018). The examples of Andy and Brian clearly indicate that researchers’ inconsistence in using their names for scholarly publication is a cause of the author name ambiguity problem.

Apart from individual researchers’ personal choice, culture can also play a role in naming practices. Researchers may change their surnames to follow a local culture or tradition. For instance, married women in Hong Kong may place their husbands’ surname in front of their maiden name to form a two-character new surname. In the case of Lam Cheng Yuet-ngor, the former Chief Executive of Hong Kong, Lam is her husband’s surname and Cheng her maiden surname. Some female researchers married to Westerners may choose to drop their original Chinese surname and adopt their spouses’ Western surname. We know at least one example of this kind but refrain from sharing it for the sake of privacy. A researcher of any sex in a relationship may adopt their partners’ surname, permanently or temporally, but retain or discard their original surname.

Researchers may adopt different transliteration systems (e.g., Pinyin and Wade-Giles) to Romanize their personal names, depending on the culture which they were born into and/or brought up in. While the adoption of the Pinyin system in transliterating personal names has been mandated in China since 1978 (State Council of China), the Wade-Giles system and its variants are adopted within the Chinese community outside of China, where languages other than Mandarin Chinese, such as Cantonese, Hakka, Hokkien, Teochew and Hainanese, are spoken. Researchers in China who published internationally in English both prior to and after the official introduction to the Pinyin system may have published under different names due to the adoption of different transliteration systems. Last but not least, even if a comprehensive set of regulations on presenting personal names in publication by-lines were introduced and universally adopted for international science communication, honest error could be made by journals/publishers, bibliographic databases, and even researchers themselves in handling author names (Gasparyan et al. Citation2016). To conclude, the reasons for the author ambiguity problem can be personal, cultural, and institutional, as summarized in .

Table 1. Reasons for changing Pinyin transliterations of Chinese personal names.

ORCID as a promising solution to author name ambiguity

Since Bagga and Baldwin’s (Citation1998) call for author name disambiguation two and half decades ago, a plethora of algorithm-based techniques, which rely on various types of auxiliary information (e.g., affiliations, e-mail accounts, address, coauthors, and disciplines), have been proposed to tackle the author name ambiguity problem (for a comprehensive review, see Sanyal, Bhowmick, and Das Citation2021). However, without manual data curation, none of them can ensure complete data accuracy due to the lack or incompleteness of auxiliary information (Shin et al. Citation2014; Smalheiser and Torvik Citation2009) and the multi-authorship, multi-disciplinary and multi-institutional nature of academic publications (Smalheiser and Torvik Citation2009). In response, some bibliographic databases, such as Scopus and Web of Science, have established their own author identification system. However, the usefulness of those author identification systems is limited due to their database-specific applicability. Accordingly, there was an urgent need for a universal author identification system.

In 2009, the Open Researcher Contributor Identification (ORCID) Initiative was introduced as a long-desired ultimate solution to the challenging author name ambiguity problem (Haak et al. Citation2012). Three years later, ORCID registration was made available to individual researchers free of charge, and the yearly active ORCID researchers amounted to 8.09 million at the time of writing this article (ORCID Citationn.d.-b). Given its increasing popularity, the ORCID solution is envisioned to solve the author name ambiguity problem. However, recent investigations identified “systematic variations in adoption and integration of ORCID into publication metadata” (Youtie et al. Citation2017, 437) and a relatively low frequency of ORCID use in publications archived by PubMed and MEDLINE (Boudry Citation2021). In particular, researchers from the USA and Asian countries, especially China, have adopted ORCID less frequently than their European counterparts have (Youtie et al. Citation2017). Therefore, we argue that it is now high time to launch an ORCID-based “Prevention plus Cure” campaign against the author ambiguity problem. While realization of the objective of prevention mandates every author’s consistent use of their ORCID, the objective of cure can be achieved by retrospectively integrating ORCID to publications, following the same logic as assigning DOIs to backfile publications.

The ORCID-based “Prevention plus Cure” campaign requires joint efforts and proactive participation of various stakeholders, as summarized in . Notably, there are four prerequisites for its success. First, every researcher should register for one ORCID account (preferably in one’s full name) for science communication and use it throughout their academic career. Second, journals/publishers should make it procedurally compulsory and technically convenient for authors to disclose ORCIDs in their own publications, as exemplified by Journal of Material Science (Carter and Blanford Citation2017) and many other journals. Third, the algorithm-based author name disambiguation techniques that are currently in use should be optimized to minimize data inaccuracy. Fourth, manual curation of author-related data and authoritative correction to inaccurate data should serve as a supplement to adopted algorithm-based author name disambiguation techniques. In addition to satisfying the above four prerequisites, stakeholders are also expected to do more as outlined below.

Table 2. Institutional stakeholder duties for the ORCID-based “Prevention plus Cure” campaign.

Researchers should be consistent in using their name, be it a legal one or a pen name, for scholarly communication throughout their academic career. Those who have published in different names should stick to the current one and refrain from making further changes. Researchers with an ORCID should publicize on their ORCID webpages a full record of their own publications. By so doing, they can not only distinguish themselves from their namesakes but also provide reliable data for developing better algorithm-based author name disambiguation techniques. For the same purposes, researchers should always provide in their publications sufficient and accurate author-related information (i.e., author names, ORCID accounts, affiliations, e-mail accounts, and addresses) (Huang et al. Citation2014; Morillo, Santabárbara, and Aparicio Citation2013). Researchers are expected to recommend consistent use of ORCID to their coauthors, colleagues, and research students. They are also encouraged to report to journals/publishers and bibliographic databases any inaccuracy in author-related information. New entrants to the academic publishing system (e.g., research students and beginning researchers) should be made aware of the importance of a career-wide name for scholarly communication via academic supervision and/or institutional training. They are also advised to apply for an ORCID before submitting their maiden work for publication and to keep their ORCID accounts for subsequent publications.

Journals/publishers should consider modifying their regulations on author name disclosure. Specifically, they can mandate disclosure of full names of cited authors on reference lists. Making such a change is justified by the facts that full names are more immune than those with initials to name ambiguity (Harzing Citation2015) and that electronic academic publishing is free from the constraint of publishing space. Journals/publishers can also advocate use of tone-marking diacritics of author names transliterated from a language characterized with tone variations. Additionally, journals/publishers can consider allowing authors to provide in by-lines both their transliterated names and those in their mother tongues, with the latter being parenthesized and following the former, as American Astronomical Society (Citationn.d.) journals do. Since these three suggested changes can only reduce the possibility of author name ambiguity, it is more desirable that journals/publishers focus on promoting the use of ORCID as a promising solution to the author name ambiguity problem. To this end, journals/publishers should make it mandatory for researchers to use ORCID in accepted manuscripts. Moreover, they should develop and follow a workflow that enables retrospective integration of ORCIDs into extant publications without such identifiers. Ideally, such a workflow covers all possible scenarios of ORCID absence (and maybe inconsistence). For instance, if a deceased author did not apply ORCID to some of his/her publications, the same ORCID should be integrated into all those publications posthumously.

Bibliographic databases, such as Web of Science, Scopus, and PubMed, should strictly apply the principle of modification-free data coverage in providing author-related information as part of metadata of all their indexed publications. In other words, they should avoid replacing authors’ full given names with initials and update ORCID integrated into publications retrospectively. The ORCID organization is expected to consider assigning an ORCID to every deceased researcher who did not have one, and then those posthumously assigned ORCID can be shared with and integrated retrospectively by journals/publishers into all the publications of deceased researchers. To endorse and promote the ORCID-based solution, other stakeholders (e.g., research institutions, research funding agencies, and conference organizers) can make it an institutional requirement for researchers to submit an ORCID-generated researcher profile. Last but not least, it is desirable that a consortium of various stakeholders (i.e., journals/publishers, bibliographic databases, research-performing organizations, funding agencies, and the ORCID organization) can be established to share author name disambiguation algorithms and data as part of orchestrated efforts to conduct the ORCID-based “Prevention plus Cure” campaign.

ORCID as a tool against misconduct in academic publishing

Our above discussion on the author name ambiguity problem was based on the assumption that every researcher publishes ethically and responsibly. Unfortunately, such an assumption is challenged by the pervasive phenomenon of ghost authorship and gift authorship (Flanagin et al. Citation1998). These two types of unethical authorship involve people taking up a position in a by-line despite a lack of (substantial) contribution to the published research or the process of writing it up. An alarming case of ghost authorship, as exposed by Perron, Hiltz-Perron, and Victor (Citation2021), is that the publication is but a paper mill product derived from no research work at all. An exceptional scenario of gift authorship is imposed gift authorship, in which the authorship receiver remains unaware of the existence of the authorship giver and of being listed as a coauthor until the publication ends up being retracted due to misconduct (Marcus Citation2021). The worst scenario possible is a mixture of ghost authorship and imposed gift authorship, where an innocent researcher is unknowingly listed by an ill-intended researcher as a coauthor of an AI-generated paper mill article which ends up being retracted for misconduct, as revealed by Oransky (Citation2021). Apparently, both ghost and gift authorship inflate researcher visibility through undeserved publications, intentionally or unintentionally, which can exacerbate the author name ambiguity problem when ghost and gift authorship receivers have any of the four types of namesakes mentioned earlier.

Researchers who have any of the four types of namesakes in academia may reap two windfall-like benefits at the cost of their namesake researchers. They can not only enlarge their oeuvre by amalgamating their namesakes’ publications with their own (Harzing Citation2015) but also impute their own undesirable publishing history (e.g., corrections, expressions of concern, and retractions) to their namesakes. These two problems can be effectively addressed through the universal adoption of ORCID in academic publishing. Disappointingly, over a decade after the launch of the ORCID initiative, it is still common that journals/publishers make it optional for corresponding authors to provide their ORCIDs at the stage of submission and add their coauthors’ only after article acceptance. If ORCID disclosure of all coauthors of a manuscript at the submission stage is made mandatory by journals/publishers, all the coauthors will be notified through their ORCID-linked e-mail account to confirm their authorship of the submitted manuscript and their consent to the submission. Such a workflow can not only reduce the risk of imposed gift authorship but also help to distinguish legitimate journals/publishers from predatory ones.

As it takes resources to ensure that every author in a publication by-line is attached to a unique ORCID, predatory journals/publishers, which are inherently profit-prioritizing, may ignore to make such an investment. If they happen to adopt ORCID, it will make their clients’ record of publishing with predatory journals/publishers easily retrievable, which may discourage their potential clients. Therefore, ignorance of ORCID can be seen as an indicator of predatoriness of journals/publishers. Fortunately, universal ORCID adoption will make it possible to systematically track individual researchers’ undesirable history of academic publishing. Specifically, it is envisioned that an ORCID-based author search in the Retraction Watch Database in the future will be returned with a full list of a specific researcher’s corrections, expressions of concern, and retractions, if any. To make better use of ORCID as an effective deterrent, if not a weapon, against various unethical publishing practices (e.g., paper mill publications, imposed gift authorship, and predatory publishing), research institutions and research funding agencies can consider requiring researchers to submit ORCID-adopting publications for research assessment (Haak, Meadows, and Brown Citation2018) and grant application (Notay Citation2018), as argued earlier on.

ORCID adoption in China

In this section, we first examine the current state of ORCID adoption in China, providing an overview of its integration within the Chinese science community. We then highlight the primary challenges that could hinder the effective implementation of ORCID in China. Following this, we offer a set of recommended solutions to address these challenges, aiming to enhance the adoption and utility of ORCID. We conclude by exploring how ORCID can be leveraged to strengthen China’s efforts to combat research misconduct, the main cause of a rocketing number of publication retractions.

The current state

According to our literature search on the China National Knowledge Infrastructure (CNKI), the ORCID initiative was first introduced to the Chinese science community in 2010. In 2014, the National Science Library of the Chinese Academy of Sciences launched the iAuthor platform,Footnote3 providing a convenient way for Chinese researchers to register for an ORCID account (Haak Citation2014). However, at the time of writing this article, only 69,670 researchers have registered for an ORCID account via the iAuthor platform. According to the ORCID statistics as of late January 2024, the number of Chinese researchers with active ORCID accounts was approximately 388,973, accounting for approximately 4.82% of the total of 8.07 million ORCID account holders worldwide (ORCID Citationn.d.-b). These statistics indicate that ORCID has not been widely adopted in China,Footnote4 despite calls from many Chinese scholars for its adoption (Bai Citation2015; Huang, Yu, and Zhang Citation2017; Li, Xu, and Shi Citation2016; Ma and Huang Citation2016; Sun et al. Citation2015; Wang, Huang, and Liu Citation2015).

Notably, out of 1,356 ORCID member organizations from 59 countries, only 6 were based in China, accounting for less than 0.45% (ORCID, Citationn.d.-a).Footnote5 This inactive involvement of Chinese organizations may have contributed to the low level of ORCID adoption among Chinese researchers. Notably, data obtained in early October 2020 revealed that approximately 177,700 researchers from 42 universities on the Double First-Class Construction scheme (hereafter referred to as Double First-Class universities) had registered for an ORCID account (Dou, Zhang, and Zhang Citation2022). This number accounted for over 45% of the active ORCID accounts as of late January 2024, indicating a significant representation of Double First-Class University researchers within the Chinese community of ORCID users.

Possible challenges

The participation of various stakeholders, such as journals/publishers, bibliographic databases, funders, research-performing organizations, and individual Chinese researchers is crucial for promoting ORCID adoption in the Chinese science community. However, not all manuscript handling systems adopted in China have integrated ORCID. It was reported that Chinese academic journals are required to pay for activation of the ORCID functionality embedded into their manuscript handling systems (Pan et al. Citation2020). This poses a challenge for ORCID adoption, as the majority of Chinese academic journals are operated independently by universities with limited financial resources and, therefore, may not be motivated to adopt ORCID. Furthermore, even among the Chinese academic journals that have taken the lead in adopting ORCID, many do not provide direct access to authors’ ORCID webpages, requiring readers to manually enter a 16-digit number into the ORCID website. This inconvenience not only hinders the visibility of ORCID but also underrepresent its utility. As a result of the low ORCID adoption by Chinese academic journals, none of the major Chinese bibliographic/citation databases, such as CNKI, Wanfang Data, and VIP, has enabled author search via ORCID, in contrast to their international counterparts like Scopus and Web of Science.

Since the ORCID record schema can provide a common information core required by research funders worldwide (Brown and Demeranville Citation2018), it is expected that Chinese research funders will eventually join their international counterparts (Notay Citation2018) in mandating ORCID adoption by grant applicants. However, as of now, no Chinese research funder, at any level, has become an ORCID member organization, and there is no indication that any Chinese research funder requires applicants to disclose their ORCID accounts. This lack of endorsement from their research funders may discourage some Chinese researchers from registering for an ORCID account. Despite the active investment of prominent Chinese research-performing organizations, such as Tsinghua UniversityFootnote6 and Lanzhou University,Footnote7 in their development of institutional repositories, the adoption rate of ORCID within these repositories remains low. This is evident from the limited disclosure of ORCID accounts on individual researchers’ personal webpages. Notably, the Confederation of China Academic Institutional RepositoryFootnote8 does not allow ORCID-based author search on its database. This lack of ORCID adoption by Chinese academic institutional repositories can be attributed to ORCID’s narrow focus on research-related activities, which does not cater to the broader administrative needs of Chinese research-performing organizations. One area where ORCID falls short is in facilitating the administration of data on teaching performance, which is a crucial component of Chinese universities’ annual Key Performance Indicator exercises. Unfortunately, ORCID does not currently support teaching performance metrics. Their expectations of the all-in-one functionality of ORCID may hinder the adoption of ORCID among Chinese research-performing organizations.

Some Chinese researchers submitting manuscripts to Chinese academic journals may be unaware of the importance of ORCID, as well as reluctant to share their published research works (Pan et al. Citation2020; Wei Citation2015). Many other Chinese researchers may have already registered for an ORCID account and have to use it on international publishing platforms, where ORCID is often mandated for corresponding authors (ORCID, Citation2015). This could explain why Chinese researchers who primarily publish in Chinese academic journals may not actively use their ORCID accounts. They may not see the value in providing a complete and regularly updated record of their Chinese publications on the ORCID platform, especially if they have to manually upload them. This inconvenience may lead to their perception of the limited usefulness of ORCID. Furthermore, unlike publication records automatically synced from journals/publishers, manually added Chinese publications do not have institutional endorsement, which may reinforce Chinese researchers’ misperception of the limited usefulness of ORCID. Additionally, unlike the iAuthor platform, ORCID does not allow researchers to include a profile photo, making it impossible to generate a curriculum vitae with a profile photo directly from ORCID. This lack of a secondary functionality may discourage some researchers from fully embracing ORCID. The incomplete, inaccurate, and untimely nature of the data provided on many Chinese researchers’ ORCID webpages (Pan et al. Citation2020) may also have discouraged Chinese institutions from utilizing ORCID.

Security concerns have been identified in the Chinese literature as a major barrier to ORCID adoption in the Chinese science community, especially in regard to Chinese national security in science and technology. Some Chinese scholars (Dou, Zhang, and Zhang Citation2022) have raised concerns about the potential threat to the privacy of talents worldwide, as ORCID-based profiling of talents could enable precise monitoring and targeting of individual researchers. Informed by consultations with Chinese security experts, Chinese Journal of Cancer Research publicly announced its decision not to require ORCID disclosure from its authors (Dai Citation2021). The decision was made based on four main concerns: serious flaws in the identification of researchers, harm to academic freedom and the creation of a new monopoly in academic publishing, the potential threat to privacy from collecting and selling ORCID data, and the risk of arbitrary management of ORCID data (Dai Citation2021). In order to address these concerns and protect Chinese national security in science and technology, it is suggested that Chinese researchers should be encouraged to publish their research in domestic academic journals, and a nation-level system for managing the identities of Chinese researchers be established through legislation (Dai Citation2021).

Fear of potential negative consequences may deter some Chinese researchers from fully adopting ORCID. Complete adoption of ORCID would expose researchers’ undesirable publishing histories, including corrections, expressions of concern, or retractions. Having their full publication records displayed on their ORCID webpages might lead to reputational damage (Xu and Hu Citation2022). Furthermore, full adoption of ORCID would facilitate the efficient retrieval of individual researchers’ retracted publications, potentially resulting in various tangible sanctions, such as exclusion from research funding applications (Tang, Wang, and Hu Citation2023). To avoid these adverse effects, some researchers might intentionally create and use multiple ORCID accounts for different aspects of their science communication, thereby undermining ORCID’s primary function of linking individual researchers with all their research outputs.

Suggested solutions

Promoting the adoption of ORCID among Chinese researchers can be achieved by effectively addressing the security concerns of the Chinese science community, offering ORCID functionalities that cater to a wider range of needs of Chinese researchers and institutions, and highlighting successful cases of ORCID adoption in China. To ensure the data security of Chinese ORCID users, ORCID could explore the possibility of establishing a dedicated data server within China, similar to Apple’s establishment of a data center in Guizhou exclusively for storing the data of Chinese Apple users. This localized data storage approach could enhance trust and compliance with Chinese regulations while safeguarding the privacy and security of Chinese researchers’ data on the ORCID platform. Furthermore, it is recommended that ORCID further expand its utility by encompassing all academic-related data, such as administrative services, teaching, supervision, and social outreach. Ideally, the functionality and data coverage of ORCID should enable the automatic generation of comprehensive CVs for academics. This would provide a more holistic view of an individual’s academic contributions and facilitate streamlined documentation and presentation of their academic achievements.

Conducting case studies can provide valuable insights into the adoption of ORCID in select journals included in the Excellence Action Plan for China’s STM Journals. This plan, initiated by the China Association for Science and Technology with the aim of internationalizing Chinese STM journals, provides a strategic opportunity for promoting ORCID adoption. Additionally, conducting case studies on Chinese journals that have taken the lead in adopting ORCID, such as New Technology of Library and Information Service, Chinese Journal of Scientific and Technical Periodicals, Journal of Library Science in China, and Journal of Zhejiang University (Humanities and Social Sciences), can serve as compelling examples to showcase successful ORCID adoption. Furthermore, showcasing the adoption of ORCID in institutional repositories of some of the current 147 universities on the Double First-Class Scheme, such as Tsinghua University and Lanzhou University, can serve as compelling models in China. These universities, driven by their aspiration to improve global university rankings, may prioritize publishing in international journals over domestic ones. As a result, they are likely to demonstrate a higher motivation for adopting ORCID.

The Hong Kong Principles for assessing researchers highlights the importance of recognizing ORCID adoption as a commendable contribution to the open science movement (Moher et al. Citation2020). Individual researchers who are actively involved in academic publishing should take the lead in embracing ORCID and fostering a culture of ORCID-facilitated open science. Without such a culture, the Prevention objective of the “Prevention plus Cure” campaign cannot be achieved, thereby making the attainment of the Cure objective even more challenging. The establishment of a culture of ORCID-facilitated open science heavily relies on the commitment of individual researchers to honesty, accountability, and transparency in their academic activities.

It is important for ORCID to develop technical solutions that minimize the occurrence of duplicate ORCID accounts and maximize the visibility of full publication records for ORCID users. By implementing measures that deter individuals from gaming the academic publishing system with multiple ORCID accounts, ORCID can uphold the system’s integrity and reliability, thereby encouraging more institutions to adopt ORCID. In addition to technical solutions, it is vital to educate the science community about the significance of disclosing undesirable publishing histories. Instead of being viewed in a negative light, these disclosures should be acknowledged by the science community as compelling evidence of researchers’ dedication to honesty, accountability, and transparency after the fact. Transparently acknowledging and addressing mistakes or issues in the publishing process contributes to the overall integrity of scientific research and promotes responsible scholarly communication. As a result, researchers who proactively disclose their undesirable publishing histories can gain positive recognition from the science community, as advocated by Benoit’s (Citation2015) image repair theory.

ORCID-empowered fight against research misconduct

It is recommended that ORCID establish records for researchers’ breaches of research integrity (Chen Citation2016), which can be utilized by research administrators in identifying cases of research misconduct and subsequently imposing sanctions. In other words, the ORCID adoption among researchers with a poor record of research integrity may result in not only reputational damage but also concrete consequences. For instance, in August 2022, the Chinese Ministry of Science and Technology, along with 21 other national governing bodies, introduced a comprehensive set of regulations on investigating and sanctioning violations of research integrity.Footnote9 These regulations aim to serve as a strong deterrent against research misconduct among Chinese researchers. However, for these regulations to be effective, it is essential to precisely and efficiently identify researchers with a record of publications retracted due to research misconduct. This is where the connectivity provided by ORCID, which links researchers to their complete publication records, can play a significant role.

A global success of the ORCID-based “Prevention plus Cure” campaign can facilitate the identification of researchers with a record of retractions due to research misconduct, making the process efficient and, ideally, effortless. Given that China has a larger number of retracted publications and a higher retraction rate compared to other countries (van Noorden Citation2023), it can benefit more from this initiative. To strengthen research integrity and combat research misconduct, China must impose appropriate sanctions on researchers with a record of retractions due to research misconduct. Universal ORCID adoption can enable efficient identification of repeat offenders with multiple retractions while avoiding the inadvertent penalization of their namesakes without a retraction record or with fewer retractions. By leveraging the ORCID capabilities, the “Prevention plus Cure” campaign can empower China’s fight against research misconduct.

Concluding remarks

Once fully implemented, ORCID can effectively address the issue of author name ambiguity, thereby enabling efficient researcher-centric retrieval of bibliometric data and thus facilitating the evaluation of research performance at the micro level of individual researchers. Researcher-specific evaluation of research performance should include retracted publications by treating them as a discrediting indicator of research performance, given their contamination of the scientific literature at the least, especially in cases of research misconduct. As evidenced by China’s recent national review of retractions (Mallapaty Citation2024), the Chinese authorities overseeing research integrity urgently need to swiftly track retracted publications and accurately identify the involved researchers. By endorsing and promoting participation in the ORCID-based “Prevention plus Cure” campaign, China can strengthen its combat against research misconduct, foster a culture of accountability, ensure the integrity of its scientific output, and ultimately protect its reputation in the global science community.

However, despite the necessity and the benefits of ORCID adoption, the Chinese science community may face various obstacles in fully embracing ORCID, such as limited resources, dissatisfaction about the current functionality of ORCID, national security concerns related to science and technology, and fear among misbehaving researchers of adverse consequences from ORCID adoption, as discussed above. Another factor influencing ORCID adoption in China may be a collective tendency to overlook breaches of research integrity. As of 2021, the Retraction Watch Database had documented 15,727 retracted publications affiliated to Chinese institutions, accounting for approximately 46% of the worldwide total of 34,223 retractions.Footnote10 In contrast, only 404 publications had been retracted from Chinese academic journals through retraction notices as of 2021 (Xu Citation2023). Despite Chinese academic journals publishing a significantly higher number of research papers by Chinese researchers compared to international journals, they had seen drastically fewer retractions. However, it can be reasonably inferred that if Chinese academic journals had adopted the retraction criteria followed by international journals, a similar, if not greater, number of retractions would have been expected. In light of this situation, if ORCID ends up being abandoned by the Chinese science community, it would become imperative for China to develop its own researcher identification system and mandate nationwide adoption of it. Such a national identification system would enable the effective identification of misbehaving Chinese researchers, providing a powerful tool to curb research misconduct more effectively. However, the contrasting behavior of Chinese researchers in international and domestic journals, as suggested by the significant difference in the number of retractions, raises the question of whether a robust identification system like ORCID or its Chinese equivalent would be truly considered necessary and welcomed in the Chinese science community.

Whether China fully embraces ORCID or decides to establish its own national researcher identification system for any reason, the international science community may not experience a significant impact as long as the “Prevention plus Cure” campaign, hopefully implemented outside of China, covers all the research outputs of Chinese researchers appearing on international publishing platforms. However, if China eventually comes up with a national researcher identification system that does not interoperate with ORCID, it could inconvenience Chinese researchers who publish internationally, as they would have to navigate between two different identification systems. The worst scenario would be that China’s development and implementation of its own researcher identification system could potentially isolate the Chinese science community from the international science community, impeding international collaboration and hindering its progress in science and technology. To promote effective ORCID adoption in China or inform the development and implementation of a Chinese national researcher identification system, it is crucial to conduct empirical research on the current state of ORCID adoption in China. The existing findings on ORCD adoption in China have predominantly relied on observational data and relevant literature (e.g., Pan et al. Citation2020), highlighting the need for empirical research to assess the extent to which various obstacles have impeded ORCID adoption in China. Therefore, future empirical research should analyze the current status of ORCID adoption in China, with a specific focus on key stakeholders, such as prominent Chinese academic journals, major bibliographic databases, and researchers in Double First-Class universities and national research institutions. Further research is also needed to explore the degree to which ORCID can unambiguously identify Chinese authors of retracted publications.

It is worth noting that it took 15 years for DOI to be officially recognized and widely adopted in China. Initially introduced to the Chinese science community in 2000, DOI was only included in China’s National Standards in 2015. Similarly, ORCID was first introduced as an initiative to the Chinese science community in 2010 and was adopted internationally in 2012. However, despite its introduction 12 years ago, the Chinese science community is still hesitant in fully embracing ORCID. It is possible that it may take even longer for the Chinese science community to fully recognize and adopt ORCID compared to DOI. Additionally, the introduction of Research Organization Registry (ROR) in 2015, which aims to distinguish every research organization in the world with a persistent identifier, further emphasizes the importance of persistent identifiers for digitalized international science communication. Surprisingly, at the time of writing this article, our search on CNKI yielded no literature introducing ROR. The three identification systems, namely ROR, ORCID, and DOI, work together to form the infrastructure for digitalized international science communication. It remains to be seen when and how the Chinese science community will adopt all the three identification systems for the benefits of both the Chinese and international science community.

Acknowledgments

We would like to express our sincere appreciation to the ORCID Support team for their prompt and helpful responses to our inquiries regarding ORCID. We would also like to acknowledge the valuable assistance provided by Dr. Huimei Zhang at Huanggang Normal University and Ms. Hongping Xu at Sanjiang University in investigating the ORCID adoption by Chinese academic journals. Furthermore, we are grateful to the three reviewers for their constructive critical feedback, which greatly contributed to the expansion and improvement of the original version. Any errors that remain in this article are solely our own.

Disclosure statement

No potential conflict of interest was reported by the author(s).

Additional information

Funding

The author(s) reported there is no funding associated with the work featured in this article.

Notes

4. In this article, China refers to the mainland of China, excluding Hong Kong, Macau, and Taiwan.

5. The query was conducted on February 4, 2024.

10. The query was conducted in the Retraction Watch Database on February 4, 2024.

References

  • Adams, J., R. Fry, D. Pendlebury, R. Potter, and G. Rogers. 2023. Global Research Report: China’s Research Landscape. Institute for Scientific Information. https://clarivate.com/wp-content/uploads/dlm_uploads/2023/10/XBU1202260247-ISI-GRR-Q4_Chinas-Research-Landscape-Report_DIGITAL.pdf.
  • American Astronomical Society. n.d. “Including Author Names Using Chinese, Japanese, and Korean Characters.” Accessed November 8, 2023. https://journals.aas.org/nonroman/#:~:text=Authors%20have%20the%20option%20to,parentheses%20after%20the%20English%20name.
  • Bagga, A., and B. Baldwin. 1998. “Entity-Based Cross-Document Coreferencing Using the Vector Space Model.” In 36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics, Montreal, Quebec, Canada, 79–85. Vol. 1.
  • Bai, H. 2015. “Introduction of Integration Between ORCID and Institutional Repository.” New Technology of Library and Information Service 3:8–17. ht tps://d oi.org/CNKI:SUN:XDTQ.10.2015/03/003.
  • Benoit, W. L. 2015. Accounts, Excuses, and Apologies: Image Repair Theory and Research. 2nd ed. Edited by William L. Benoit. Albany, NY: State University of New York Press.
  • Boudry, C. 2021. “Availability of ORCIDs in Publications Archived in PubMed, MEDLINE, and Web of Science Core Collection.” Scientometrics 126 (4): 3355–3371. https://doi.org/10.1007/s11192-020-03825-7.
  • Brown, J., and T. Demeranville. 2018. “A Year in ORBIT.” ORCiD. Last Modified November 28, 2018. Accessed January 27. https://info.orcid.org/a-year-in-orbit/.
  • Butler, D. 2012. “Scientists: Your Number Is Up.” Nature 485 (7400): 564. https://doi.org/10.1038/485564a.
  • Carter, C. B., and C. F. Blanford. 2017. “All Authors Must Now Supply ORCID Identifiers.” Journal of Materials Science 52 (11): 6147–6149. https://doi.org/10.1007/s10853-017-0919-7.
  • Castillo, M. 2009. “Authors’ Names in a Globalized American Journal of Neuroradiology.” American Journal of Neuroradiology 30 (3): 441–442. https://doi.org/10.3174/ajnr.A1367.
  • Chen, R. 2016. “Strengthen Review of authors’ Information to Prevent Academic Cheating: Englightenments from a Case of Suspected Academic Cheating.” Acta Editologica 28 (2): 144–146. https://doi.org/10.16811/j.cnki.1001-4314.2016.02.015.
  • Dai, L. 2006. “Indexing Personal Names.” The Indexer 25 (2).
  • Dai, N. 2021. “Enhance the Safety Consciousness and Hold the Bottom Line of Science and Technology Security.” Acta Editologica 33 (5): 492–496. https://doi.org/10.16811/j.cnki.1001-4314.2021.05.005.
  • Dou, T., S. Zhang, and B. Zhang. 2022. “Current Situation, Risks and Countermeasures of ORCID.” Digital Library Forum 2:47–52. https://doi.org/10.3772/j.issn.1673-2286.2022.02.007.
  • Du, G. 2007. “Name Game: Most Chinese Use 3 Characters, Some Use 10 or More.” China View. Last Modified December 12. Accessed November 7, 2023. http://news.xinhuanet.com/english/2007-12/12/content_7234658.htm.
  • Editorial Committee of Chinese Academic Journals. 1999. Data Norm for Retrieval and Evaluation of Chinese Academic Journal (CD) (CAJ-CD B/T1-1998). Beijing, China: General Administration of Press and Publication.
  • Editorial Committee of Chinese Academic Journals. 2006. Data Norm for Retrieval and Evaluation of Chinese Academic Journal (CD) (CAJ-CD B/T 1-2006). Beijing, China: General Administration of Press and Publication.
  • Flanagin, A., L. A. Carey, P. B. Fontanarosa, S. G. Phillips, B. P. Pace, G. D. Lundberg, and D. Rennie. 1998. “Prevalence of Articles with Honorary Authors and Ghost Authors in Peer-Reviewed Medical Journals.” JAMA 280 (3): 222–224. https://doi.org/10.1001/jama.280.3.222.
  • Gao, X. A., and Y. Zheng. 2018. “‘Heavy mountains’ for Chinese Humanities and Social Science Academics in the Quest for World-Class Universities.” Compare: A Journal of Comparative and International Education 50 (4): 554–572. https://doi.org/10.1080/03057925.2018.1538770.
  • Gasparyan, A. Y., M. Yessirkepov, A. N. Gerasimov, E. I. Kostyukova, and G. D. Kitas. 2016. “Scientific Author Names: Errors, Corrections, and Identity Profiles.” Biochemia Medica 26 (2): 169–173. https://doi.org/10.11613/BM.2016.017.
  • Granshaw, S. I. 2019. “Research Identifiers: ORCID, DOI, and the Issue with Wang and Smith.” Photogrammetric Record 34 (167): 236–243. https://doi.org/10.1111/phor.12290.
  • Haak, L. L. 2014. “iAuthor and ORCID: Supporting International Identifiers for Chinese Researchers at the National Science Library of the Chinese Academies of Sciences.” Last Modified December 3, 2014. Accessed January 31 https://info.orcid.org/iauthor-and-orcid-supporting-international-identifiers-for-chinese-researchers-at-the-national-science-library-of-the-chinese-academies-of-sciences/.
  • Haak, L. L., M. Fenner, L. Paglione, E. Pentz, and H. Ratner. 2012. “ORCID: A System to Uniquely Identify Researchers.” Learned Publishing 25 (4): 259–264. https://doi.org/10.1087/20120404.
  • Haak, L. L., A. Meadows, and J. Brown. 2018. “Using ORCID, DOI, and Other Open Identifiers in Research Evaluation.” Frontiers in Research Metrics and Analytics 3. https://doi.org/10.3389/frma.2018.00028.
  • Harzing, A.-W. 2015. “Health Warning: Might Contain Multiple Personalities—The Problem of Homonyms in Thomson Reuters Essential Science Indicators.” Scientometrics 105 (3): 2259–2270. https://doi.org/10.1007/s11192-015-1699-y.
  • He, Z., ed. 2003. The Hundred Surnames. Qingdao, China: Qingdao Publishing.
  • Household Management Research Center of the Ministry of Public Security. 2021. ““2020 National Name Report” Released.” Last Modified February 8. Accessed November 5, 2023. https://www.mps.gov.cn/n2254314/n6409334/c7726021/content.html.
  • Huang, S., B. Yang, S. Yan, and R. Rousseau. 2014. “Institution Name Disambiguation for Research Assessment.” Scientometrics 99:823–838. https://doi.org/10.1007/s11192-013-1214-2.
  • Huang, Y., Q. Yu, and J. Zhang. 2017. “Research and Realization of Interactions in Multiple Systems Based on ORCID iD.” Library Journal 36 (11): 70–78. https://doi.org/10.13663/j.cnki.lj.2017.11.009.
  • International Organization for Standardization. 1982. “Information and Documentation: Romanization of Chinese (ISO Standard No. 7098: 1982).” https://www.iso.org/standard/13682.html.
  • International Organization for Standardization. 2015. “Information and Documentation: Romanization of Chinese (ISO Standard No. 7098: 2015).” https://www.iso.org/standard/61420.html.
  • Kałużyńska, I. 2015. “Chinese Naming–Substitution by Homophones.” Acta Asiatica Varsoviensia (28):79–91.
  • Liu, Y., H. Shu, and P. Li. 2007. “Word Naming and Psycholinguistic Norms: Chinese.” Behavior Research Methods 39 (2): 192–198. https://doi.org/10.3758/BF03193147.
  • Li, J., Y. Xu, and C. Shi. 2016. “The Application of ORCID in Scientific Journals in China and Its Benefits for Researchers.” Publishing Research 11:42–45. https://doi.org/10.19393/j.cnki.cn11-1537/g2.2016.11.010.
  • Ma, Y., and J. Huang. 2016. “Realization of Application of iAuthor on Research Evaluation of Institutions.” Journal of Library Science in China 42 (222): 97–108. https://doi.org/10.13530/j.cnki.jlis.162007.
  • Mallapaty, S. 2024. “China Conducts First Nationwide Review of Retractions and Research Misconduct.” Nature 626 (8000): 700–701. https://doi.org/10.1038/d41586-024-00397-x.
  • Marcus, A. 2021. “Author, Author! Or Perhaps We Should Say Fake Author, Fake Author!” Retraction Watch (Blog). March 31. https://retractionwatch.com/2021/03/31/author-author-or-perhaps-we-should-say-fake-author-fake-author/.
  • Mauranen, A. 2003. “The Corpus of English As Lingua Franca in Academic Settings.” TESOL Quarterly 37 (3): 513–527. https://doi.org/10.2307/3588402.
  • Moher, D., L. Bouter, S. Kleinert, P. Glasziou, M. Har Sham, V. Barbour, A.-M. Coriat, N. Foeger, and U. Dirnagl. 2020. “The Hong Kong Principles for Assessing Researchers: Fostering Research Integrity.” PLOS Biology 18 (7): e3000737. https://doi.org/10.1371/journal.pbio.3000737.
  • Morillo, F., I. Santabárbara, and J. Aparicio. 2013. “The Automatic Normalisation Challenge: Detailed Addresses Identification.” Scientometrics 95:953–966. https://doi.org/10.1007/s11192-013-0965-0.
  • Notay, B. 2018. “Funders sign up to ORCID Open Letter.” Accessed January 27. Last Modified December 7, 2018. https://org/scholarlycommunications.jiscinvolve.org/wp/2018/12/07/funders-sign-up-to-orcid-open-letter/
  • Oransky, I. 2021. “Researcher Uses Fake Email Address to Submit a Paper Mill Manuscript without Corresponding author’s Knowledge.” Retraction Watch (Blog). December 13. https://retractionwatch.com/2021/12/13/researcher-uses-fake-email-address-to-submit-a-paper-mill-manuscript-without-corresponding-authors-knowledge/.
  • ORCID. 2015. “ORCID in Publications.” https://info.orcid.org/requiring-orcid-in-publications/.
  • ORCID. n.d.-a. “ORCID Member Organizations.” Accessed January 31 https://orcid.org/members.
  • ORCID. n.d.-b. “ORCID Statistics.” Accessed January 31, 2024. https://info.orcid.org/orcid-statistics/.
  • Pan, Q., J. Su, L. Zhang, J. Bai, C. Zhang, and C. Xiao. 2020. “Application Status and Enlightenment of ORCID Marking in Chinese Academic Journals.” Chinese Journal of Scientific and Technical Periodicals 31 (2): 223–228.
  • Perron, B. E., O. T. Hiltz-Perron, and B. G. Victor. 2021. “Revealed: The Inner Workings of a Paper Mill.” Retraction Watch (Blog). December 20. https://retractionwatch.com/2021/12/20/revealed-the-inner-workings-of-a-paper-mill/.
  • Qiu, J. 2008. “Identity Crisis: Chinese Authors Are Publishing More and More Papers, but Are They Receiving Due Credit and Recognition for Their Work? Not if Their Names Get Confused Along the Way.” Nature 451 (7180): 766–768. https://doi.org/10.1038/451766a.
  • Sanyal, D. K., K. P. Bhomick, and P. P. Das. 2021. “A Review of Author Name Disambiguation Techniques for the PubMed Bibliographic Database.” Journal of Information Science 47 (2): 227–254. https://doi.org/10.1177/0165551519888605.
  • Shin, D., T. Kim, J. Choi, and J. Kim. 2014. “Author Name Disambiguation Using a Graph Model with Node Splitting and Merging Based on Bibliographic Information.” Scientometrics 100:15–50. https://doi.org/10.1007/s11192-014-1289-4.
  • Smalheiser, N. R., and V. I. Torvik. 2009. “Author Name Disambiguation.” Annual Review of Information Science and Technology 43 (1): 1–43. https://doi.org/10.1002/aris.2009.1440430113.
  • State Council of China. 1978. The State Council Approved the Report on the Use of the Chinese Pinyin Scheme As the Unified Standard for the Roman Alphabet Spelling of Names of People and Places in China.
  • Strotmann, A., and D. Zhao. 2012. “Author Name Disambiguation: What Difference Does it Make in Author-Based Citation Analysis?” Journal of the American Society for Information Science and Technology 63 (9): 1820–1833. https://doi.org/10.1002/asi.22695.
  • Sun, T., J. Huang, J. Zhang, and Y. Huang. 2015. “Study on the International Identifiers for Scientists.” Library and Information Service 59 (1): 17–22; 44. https://doi.org/10.13266/j.issn.0252-3116.2015.01.002.
  • Sun, X., and J. Zhou. 2002. “English Versions of Chinese authors’ Names in Biomedical Journals: Observations and Recommendations.” Science Editor 25 (1): 3–4.
  • Tang, L., and J. Walsh. 2010. “Bibliometric Fingerprints: Name Disambiguation Based on Approximate Structure Equivalence of Cognitive Maps.” Scientometrics 84 (3): 763–784. https://doi.org/10.1007/s11192-010-0196-6.
  • Tang, L., L. Wang, and G. Hu. 2023. “Research Misconduct Investigations in China’s Science Funding System.” Science and Engineering Ethics 29 (6). https://doi.org/10.1007/s11948-023-00459-9.
  • Tardy, C. 2004. “The Role of English in Scientific Communication: Lingua Franca or Tyrannosaurus Rex?” Journal of English for Academic Purposes 3 (3): 247–269. https://doi.org/10.1016/j.jeap.2003.10.001.
  • Torvik, V. I., and N. R. Smalheiser. 2009. “Author Name Disambiguation in MEDLINE.” ACM Transactions on Knowledge Discovery from Data 3 (3): 1–29.
  • To, W. M., and B. T. Yu. 2020. “Rise in Higher Education Researchers and Academic Publications.” Emerald Open Research 2:3. https://doi.org/10.35241/emeraldopenres.13437.1.
  • van Noorden, R. 2023. “More Than 10,000 Research Papers Were Retracted in 2023 - a New Record.” Nature. https://doi.org/10.1038/d41586-023-03974-8.
  • Wang, H., J. Huang, and X. Liu. 2015. “The Application Research of ORCID in Institutional Repository Construction.” Library and Information Service 59 (17): 134–140. https://doi.org/10.13266/j.issn.0252-3116.2015.17.020.
  • Wei, Z. 2015. “The Application of the International Academic Identifier ORCID in Chinese Scientific Journals.” Research & Education 5:101–104. https://doi.org/10.16510/j.cnki.kjycb.2015.05.029.
  • Woolston, C. 2023. “Nature Index Annual Tables 2023: China Tops Natural-Science Table.” Nature. https://doi.org/10.1038/d41586-023-01868-3.
  • Xu, H. 2023. “Misconduct Performances and Suggestions for Processing Retracted Papers of Chinese Journal Databases.” Chinese Journal of Scientific and Technical Periodicals 34 (11): 1554–1561. https://doi.org/10.11946/cjstp.202308150620.
  • Xu, S. B., and G. Hu. 2022. “Retraction Stigma and Its Communication via Retraction Notices.” Minerva 60 (3): 349–374. https://doi.org/10.1007/s11024-022-09465-w.
  • Youtie, J., S. Carley, A. L. Porter, and P. Shapira. 2017. “Tracking Researchers and Their Outputs: New Insights from ORCIDs.” Scientometrics 113:437–453. https://doi.org/10.1007/s11192-017-2473-0.
  • Zheng, Y., and X. A. Gao. 2016. “Chinese Humanities and Social Sciences scholars’ Language Choices in International Scholarly Publishing: A ten-Year Survey.” Journal of Scholarly Publishing 48 (1): 1–16. https://doi.org/10.3138/jsp.48.1.1.