Â鶹´«Ã½É«ÇéƬ

News

When scientific citations go rogue: Uncovering ‘sneaked references’

Lonni Besançon Guillaume Cabanac
By Lonni Besançon and Guillaume Cabanac
Aug. 10, 2024

A researcher working alone – apart from the world and the rest of the wider scientific community – is a classic yet misguided image. Research is, in reality, built on continuous exchange within the scientific community: First you understand the work of others, and then you share your findings.

Reading and writing articles published in academic journals and presented at conferences is a central part of being a researcher. When researchers write a scholarly article, they must cite the work of peers to provide context, detail sources of inspiration and explain differences in approaches and results. A positive citation by other researchers is a key measure of visibility for a researcher’s own work.

But what happens when this citation system is manipulated? A by our team of academic sleuths – which includes information scientists, a computer scientist and a mathematician – has revealed an insidious method to artificially inflate citation counts through metadata manipulations: sneaked references.

Hidden manipulation

People are becoming more aware of scientific publications and how they work, including their potential flaws. Just last year more than . The issues around citation gaming and the harm it causes the scientific community, including damaging its credibility, are well documented.

Citations of scientific work abide by a standardized referencing system: Each reference explicitly mentions at least the title, authors’ names, publication year, journal or conference name, and page numbers of the cited publication. These details are stored as metadata, not visible in the article’s text directly, but assigned to a digital object identifier, or DOI – a unique identifier for each scientific publication.

References in a scientific publication allow authors to justify methodological choices or present the results of past studies, highlighting the iterative and collaborative nature of science.

However, we found through a chance encounter that some unscrupulous actors have added extra references, invisible in the text but present in the articles’ metadata, when they submitted the articles to scientific databases. The result? Citation counts for certain researchers or journals have skyrocketed, even though these references were not cited by the authors in their articles.

Chance discovery

The investigation began when Guillaume Cabanac, a professor at the University of Toulouse, wrote a post on , a website dedicated to postpublication peer review, in which scientists discuss and analyze publications. In the post, he detailed how he had noticed an inconsistency: a Hindawi journal article that he suspected was fraudulent because it contained awkward phrases had far more citations than downloads, which is very unusual.

The post caught the attention of several sleuths who are now the authors of the . We used a scientific search engine to look for articles citing the initial article. Google Scholar found none, but Crossref and Dimensions did find references. The difference? Google Scholar is likely to mostly rely on the article’s main text to extract the references appearing in the bibliography section, whereas Crossref and Dimensions use metadata provided by publishers.

A new type of fraud

To understand the extent of the manipulation, we examined three scientific journals that were published by the Technoscience Academy, the publisher responsible for the articles that contained questionable citations.

Our investigation consisted of three steps:

  1. We listed the references explicitly present in the HTML or PDF versions of an article.

  2. We compared these lists with the metadata recorded by Crossref, discovering extra references added in the metadata but not appearing in the articles.

  3. We checked Dimensions, a bibliometric platform that uses Crossref as a metadata source, finding further inconsistencies.

In the journals published by Technoscience Academy, at least 9% of recorded references were “sneaked references.” These additional references were only in the metadata, distorting citation counts and giving certain authors an unfair advantage. Some legitimate references were also lost, meaning they were not present in the metadata.

In addition, when analyzing the sneaked references, we found that they highly benefited some researchers. For example, a single researcher who was associated with Technoscience Academy benefited from more than 3,000 additional illegitimate citations. Some journals from the same publisher benefited from a couple hundred additional sneaked citations.

We wanted our results to be externally validated, so we posted our study , informed both Crossref and Dimensions of our findings and gave them a link to the preprinted investigation. Dimensions acknowledged the illegitimate citations and confirmed that their database reflects Crossref’s data. Crossref the extra references in and highlighted that this was the first time that it had been notified of such a problem in its database. The publisher, based on Crossref’s investigation, has taken action to fix the problem.

Implications and potential solutions

Why is this discovery important? Citation counts heavily influence research funding, academic promotions and institutional rankings. Manipulating citations can lead to unjust decisions based on false data. More worryingly, this discovery raises questions about the integrity of scientific impact measurement systems, a concern that has been highlighted by researchers for years. These systems can be manipulated to foster unhealthy competition among researchers, tempting them to take shortcuts to publish faster or achieve more citations.

To combat this practice we suggest several measures:

  • Rigorous verification of metadata by publishers and agencies like Crossref.

  • Independent audits to ensure data reliability.

  • Increased transparency in managing references and citations.

This study is the first, to our knowledge, to report a manipulation of metadata. It also discusses the impact this may have on the evaluation of researchers. The study highlights, yet again, that the overreliance on metrics to evaluate researchers, their work and their impact may be inherently flawed and wrong.

Such overreliance is likely to promote questionable research practices, including hypothesizing after the results are known, or ; splitting a single set of data into several papers, known as salami slicing; data manipulation; and plagiarism. It also hinders the transparency that is key to more and research. Although the problematic citation metadata and sneaked references have now been apparently fixed, the corrections may have, as is , happened too late.

This article is published in collaboration with , a blog for understanding digital issues.

This article is republished from under a Creative Commons license. Read the .

Enjoy reading ASBMB Today?

Become a member to receive the print edition four times a year and the digital edition weekly.

Learn more
Lonni Besançon
Lonni Besançon

Lonni Besançon is an assistant professor in data visualization at Linköping University.

Guillaume Cabanac
Guillaume Cabanac

Guillaume Cabanac is a university professor at the Institut de Recherche en Informatique de Toulouse.

Featured jobs

from the

Get the latest from ASBMB Today

Enter your email address, and we’ll send you a weekly email with recent articles, interviews and more.

Latest in Careers

Careers highlights or most popular articles

Upcoming opportunities
Announcement

Upcoming opportunities

Nov. 28, 2024

Friendly reminder: Book a recruiter table at ASBMB's career and education fair by Nov. 30 to secure early-bird pricing! Just added: Applications are being accepted for a post-bac at Dartmouth Cancer Center.

Upcoming opportunities
Announcement

Upcoming opportunities

Nov. 21, 2024

Just added: Register for ASBMB's virtual session on thriving in challenging academic or work environments.

Who decides when a grad student graduates?
Training

Who decides when a grad student graduates?

Nov. 15, 2024

Ph.D. programs often don’t have a set timeline. Students continue with their research until their thesis is done, which is where variability comes into play.

Upcoming opportunities
Announcement

Upcoming opportunities

Nov. 14, 2024

Submit an abstract for ASBMB's meeting on ferroptosis!

Join the pioneers of ferroptosis at cell death conference
In-person Conference

Join the pioneers of ferroptosis at cell death conference

Nov. 13, 2024

Meet Brent Stockwell, Xuejun Jiang and Jin Ye — the co-chairs of the ASBMB’s 2025 meeting on metabolic cross talk and biochemical homeostasis research.

A brief history of the performance review
Jobs

A brief history of the performance review

Nov. 8, 2024

Performance reviews are a widely accepted practice across all industries — including pharma and biotech. Where did the practice come from, and why do companies continue to require them?