Â鶹´«Ã½É«ÇéƬ

News

When scientific citations go rogue: Uncovering ‘sneaked references’

Lonni Besançon Guillaume Cabanac
By Lonni Besançon and Guillaume Cabanac
Aug. 10, 2024

A researcher working alone – apart from the world and the rest of the wider scientific community – is a classic yet misguided image. Research is, in reality, built on continuous exchange within the scientific community: First you understand the work of others, and then you share your findings.

Reading and writing articles published in academic journals and presented at conferences is a central part of being a researcher. When researchers write a scholarly article, they must cite the work of peers to provide context, detail sources of inspiration and explain differences in approaches and results. A positive citation by other researchers is a key measure of visibility for a researcher’s own work.

But what happens when this citation system is manipulated? A by our team of academic sleuths – which includes information scientists, a computer scientist and a mathematician – has revealed an insidious method to artificially inflate citation counts through metadata manipulations: sneaked references.

Hidden manipulation

People are becoming more aware of scientific publications and how they work, including their potential flaws. Just last year more than . The issues around citation gaming and the harm it causes the scientific community, including damaging its credibility, are well documented.

Citations of scientific work abide by a standardized referencing system: Each reference explicitly mentions at least the title, authors’ names, publication year, journal or conference name, and page numbers of the cited publication. These details are stored as metadata, not visible in the article’s text directly, but assigned to a digital object identifier, or DOI – a unique identifier for each scientific publication.

References in a scientific publication allow authors to justify methodological choices or present the results of past studies, highlighting the iterative and collaborative nature of science.

However, we found through a chance encounter that some unscrupulous actors have added extra references, invisible in the text but present in the articles’ metadata, when they submitted the articles to scientific databases. The result? Citation counts for certain researchers or journals have skyrocketed, even though these references were not cited by the authors in their articles.

Chance discovery

The investigation began when Guillaume Cabanac, a professor at the University of Toulouse, wrote a post on , a website dedicated to postpublication peer review, in which scientists discuss and analyze publications. In the post, he detailed how he had noticed an inconsistency: a Hindawi journal article that he suspected was fraudulent because it contained awkward phrases had far more citations than downloads, which is very unusual.

The post caught the attention of several sleuths who are now the authors of the . We used a scientific search engine to look for articles citing the initial article. Google Scholar found none, but Crossref and Dimensions did find references. The difference? Google Scholar is likely to mostly rely on the article’s main text to extract the references appearing in the bibliography section, whereas Crossref and Dimensions use metadata provided by publishers.

A new type of fraud

To understand the extent of the manipulation, we examined three scientific journals that were published by the Technoscience Academy, the publisher responsible for the articles that contained questionable citations.

Our investigation consisted of three steps:

  1. We listed the references explicitly present in the HTML or PDF versions of an article.

  2. We compared these lists with the metadata recorded by Crossref, discovering extra references added in the metadata but not appearing in the articles.

  3. We checked Dimensions, a bibliometric platform that uses Crossref as a metadata source, finding further inconsistencies.

In the journals published by Technoscience Academy, at least 9% of recorded references were “sneaked references.” These additional references were only in the metadata, distorting citation counts and giving certain authors an unfair advantage. Some legitimate references were also lost, meaning they were not present in the metadata.

In addition, when analyzing the sneaked references, we found that they highly benefited some researchers. For example, a single researcher who was associated with Technoscience Academy benefited from more than 3,000 additional illegitimate citations. Some journals from the same publisher benefited from a couple hundred additional sneaked citations.

We wanted our results to be externally validated, so we posted our study , informed both Crossref and Dimensions of our findings and gave them a link to the preprinted investigation. Dimensions acknowledged the illegitimate citations and confirmed that their database reflects Crossref’s data. Crossref the extra references in and highlighted that this was the first time that it had been notified of such a problem in its database. The publisher, based on Crossref’s investigation, has taken action to fix the problem.

Implications and potential solutions

Why is this discovery important? Citation counts heavily influence research funding, academic promotions and institutional rankings. Manipulating citations can lead to unjust decisions based on false data. More worryingly, this discovery raises questions about the integrity of scientific impact measurement systems, a concern that has been highlighted by researchers for years. These systems can be manipulated to foster unhealthy competition among researchers, tempting them to take shortcuts to publish faster or achieve more citations.

To combat this practice we suggest several measures:

  • Rigorous verification of metadata by publishers and agencies like Crossref.

  • Independent audits to ensure data reliability.

  • Increased transparency in managing references and citations.

This study is the first, to our knowledge, to report a manipulation of metadata. It also discusses the impact this may have on the evaluation of researchers. The study highlights, yet again, that the overreliance on metrics to evaluate researchers, their work and their impact may be inherently flawed and wrong.

Such overreliance is likely to promote questionable research practices, including hypothesizing after the results are known, or ; splitting a single set of data into several papers, known as salami slicing; data manipulation; and plagiarism. It also hinders the transparency that is key to more and research. Although the problematic citation metadata and sneaked references have now been apparently fixed, the corrections may have, as is , happened too late.

This article is published in collaboration with , a blog for understanding digital issues.

This article is republished from under a Creative Commons license. Read the .

Enjoy reading ASBMB Today?

Become a member to receive the print edition monthly and the digital edition weekly.

Learn more
Lonni Besançon
Lonni Besançon

Lonni Besançon is an assistant professor in data visualization at Linköping University.

Guillaume Cabanac
Guillaume Cabanac

Guillaume Cabanac is a university professor at the Institut de Recherche en Informatique de Toulouse.

Featured jobs

from the

Get the latest from ASBMB Today

Enter your email address, and we’ll send you a weekly email with recent articles, interviews and more.

Latest in Careers

Careers highlights or most popular articles

How do you help a biochemist find a career path?
Essay

How do you help a biochemist find a career path?

Sept. 18, 2024

Industry, academia and the ASBMB join forces to introduce students job options in the sciences with a panel, networking and cheese.

ASBMB seeks feedback on NIH postdoc training questions
Training

ASBMB seeks feedback on NIH postdoc training questions

Sept. 18, 2024

The National Institutes of Health takes steps toward addressing concerns about support caps, a funding mechanism and professional development.

Making the most of meetings with your mentor
Advice

Making the most of meetings with your mentor

Sept. 13, 2024

Everyone has a slightly different relationship with their mentor, including how often they meet. Careers columnist Courtney Chandler dives into how to make meetings with your grad school adviser useful and productive.

Growing a chapter for grad students and postdocs
Society News

Growing a chapter for grad students and postdocs

Sept. 12, 2024

At Penn State, the ASBMB is building a community to help provide these early-career researchers with the tools they need to excel in science and life.

Upcoming opportunities
Announcement

Upcoming opportunities

Sept. 12, 2024

Celebrate NPAW with ASBMB! Plus: Present your work at our virtual event on exploring AI tools in BMB education.

Upcoming ASBMB events for future industry scientists — and others
Webinars

Upcoming ASBMB events for future industry scientists — and others

Sept. 6, 2024

ASBMB offers webinars aimed at a variety of member groups, including scientists who have taken time off for caregiving or members who want to start their own labs. These three will be useful to postdocs and graduate students.