July 10th, 2024

Researchers discover a new form of scientific fraud: 'sneaked references'

Researchers identify "sneaked references" as a new form of scientific fraud, artificially boosting citation counts. Concerns arise over integrity in research evaluation systems, suggesting measures for verification and transparency. Manipulation distorts research impact assessment.

Read original articleLink Icon
Researchers discover a new form of scientific fraud: 'sneaked references'

Researchers have uncovered a new form of scientific fraud known as "sneaked references," where extra references are added to articles' metadata but not cited in the text. This manipulation artificially inflates citation counts, giving certain researchers an unfair advantage. The discovery highlights concerns about the integrity of scientific impact measurement systems and the potential for manipulation to influence research funding and academic promotions. The study suggests measures such as rigorous verification of metadata, independent audits, and increased transparency in managing references to combat this practice. The investigation found that at least 9% of recorded references in certain journals were "sneaked references," distorting citation counts. The implications of this manipulation extend to promoting questionable research practices and hindering transparency in scientific evaluation. The study emphasizes the need to address overreliance on metrics in evaluating researchers and their impact to ensure the integrity of scientific research.

Related

The case for criminalizing scientific misconduct · Chris Said

The case for criminalizing scientific misconduct · Chris Said

The article argues for criminalizing scientific misconduct, citing cases like Sylvain Lesné's fake research. It proposes Danish-style committees and federal laws to address misconduct effectively, emphasizing accountability and public trust protection.

Journal retracts all 23 articles in special issue

Journal retracts all 23 articles in special issue

A journal retracted 23 articles from a special issue due to compromised peer review. Guest editor Abbas Mardani didn't comment. Authors criticized lack of transparency and faced consequences. Publisher Springer mentioned ongoing investigations.

Elsevier withdraws plagiarized paper after author calls journal out on LinkedIn

Elsevier withdraws plagiarized paper after author calls journal out on LinkedIn

Elsevier retracted a plagiarized paper titled "Optimizing smart building energy systems" after Sadrizadeh exposed copied data. Editor-in-chief investigated and withdrew the paper due to possible figure manipulation. Lead author did not comment.

Research into homeopathy: data falsification, fabrication and manipulation

Research into homeopathy: data falsification, fabrication and manipulation

Research on homeopathy faces credibility issues due to data manipulation in a study led by Michael Frass. The study, once positive, now raises concerns of scientific misconduct, urging withdrawal of publication. Challenges persist in alternative medicine research, highlighting the conflict between ideology and scientific integrity.

Ranking Fields by p-Value Suspiciousness

Ranking Fields by p-Value Suspiciousness

The article addresses p-hacking in research, focusing on suspicious p-value clustering across fields. Economics shows more credible results. Replication crisis is universal, emphasizing the call for research integrity and transparency.

Link Icon 19 comments
By @generationP - 5 months
More info at https://retractionwatch.com/2023/10/09/how-thousands-of-invi... and https://arxiv.org/abs/2310.02192 . As the latter makes clear, this type of fraud is most likely done by journal editors (or their assistants), not by the authors:

> When registering a new publication and its references at Crossref, a publisher may sneak extra undue references in the metadata sent in addition to the ones originally present. Then, digital libraries (e.g., SpringerLink) and bibliometric platforms (e.g., Dimensions) harvest these metadata, undue citations included. These sneaked references are processed and counted even if they are not present in the original publication.

The three journals in this particular case are all published by Technoscience Academy, an OA publisher operating out of India (not one of the well-known ones). I would think twice as an author before I submitted to any journal from this publisher, lest my paper is abused for manipulations like this (although I'm not sure if it has any journals worth submitting to anyway).

NB (because I got confused first): This is not really about Hindawi. Hindawi published the (trash) article that these fake citations were pumping up, but the pumping-up happened using Technoscience Academy journals.

By @ilamont - 5 months
Article doesn't really address why this is happening. My guess would be financial incentives for researchers and professors to publish in international journals, a common practice in some countries. For instance, according to "Analysis of Chinese universities’ financial incentives for academic publications":

In recent years payments based directly on the number of citations a paper receives have become more popular, but are still much less common than those based on the journal’s impact factor.

https://opportunities-insight.britishcouncil.org/insights-bl...

By @taeric - 5 months
Hilarious to me how this is basically, "researchers discover SEO techniques."
By @Isamu - 5 months
They mean references cited in the metadata but not in the actual paper. So it’s “invisible” and can be gamed because some citation trackers rely on the metadata rather than having to parse the paper.
By @phyzix5761 - 5 months
Why did it take the article 7 paragraphs to tell me what the fraud was? Ad revenue driven journalism.
By @skyechurch - 5 months
>Some legitimate references were also lost, meaning they were not present in the metadata.

It's possible that some of the inconsistency between metadata and text could just be due to incompetence - it's harder to find a profit motive for dropping legitimate citations. Why wouldn't this sort of metadata auto-generated from the text (aside from enabling fraud, of course)?

By @probably_wrong - 5 months
I'm unclear on whether to pin this on the publisher or on the authors.

In the first example shown in the linked pre-print [1] there's a paper with 62 downloads that's been cited 107 times within two months. The pre-print looks deeper into a paper with 7 "real" references whose metadata has an extra 40 references not found in the PDF. This leaves us with three options:

  * the author of a paper with 62 downloads (not an amazing number) was convinced into joining a citation ring along with 40 other authors,
  * the publisher has been sneaking references onto unsuspecting papers, or
  * the publisher has a vulnerability on their metadata system that's being actively exploited by the two scholars identified in the pre-print.
Whatever the case, I'm glad the solution is as simple as "you should parse the references yourself". I do however wonder: is someone checking whether all of the references are actually referenced within the paper?

[1] https://arxiv.org/pdf/2310.02192

By @rdtsc - 5 months
So how does the fraud work? Researcher wants to boost his citation count so they can get more funding, respect, etc. They ask their friends to cite their paper in a metadata-only reference in their other papers, even though the papers didn't really reference anything from the original paper.

They should be able to find citation "rings" then, whole groups which regularly do this, probably associated with specific institutions or journals.

The linked study did part of this: https://asistdl.onlinelibrary.wiley.com/doi/10.1002/asi.2489...

> An analysis of the 10 sneaked references in Dimensions reveals that they benefit mainly two authors (Initials JNR & BK)

Now, it would be interesting to see if JNR and BK's publications used this trick and in turn benefitted, some other group.

By @banga - 5 months
In most cases the published articles is document that has some additional metadata added during the publishing process. Hence, there's an inherent disconnect between the article and the metadata that must be examined for accuracy and truth.

Instead, the published article should really be a view of the structured data, metadata, and text (i.e. the true content) that makes up the article. Formatting and such can be a pain, but using this approach would mean the published article is a view of the truth rather than the metadata being created as something of an afterthought.

By @saulrh - 5 months
Extracting the key definition:

  These additional references were only in the
  metadata, distorting citation counts and
  giving certain authors an unfair advantage.
Papers with metadata that doesn't match the contents of the paper. The article notes that Google Scholar is unaffected, as it extracts citations from the paper itself by parsing the text of the printed bibliography.
By @ggm - 5 months
Why isn't Hanlon's razor applicable here? Maybe for a significant number it is, and maybe not all.

The problem would be if this turns into a negative index, it can have equally bad repercussions. So attribution to malice and intent is important because there will be people adversely affected.

If this is a publisher/SEO fuckup, that needs to be seen distinct from "fraud"

By @gred - 5 months
Something similar hit the news in Spain recently:

https://cadenaser.com/castillayleon/2024/03/15/el-candidato-...

By @EricE - 5 months
By @spullara - 5 months
so is the implication those cited are paying the publishers to add them to the metadata of the papers they publish? what is the actual mechanism?
By @goodcharles - 5 months
Its SEO spam link building for academics
By @meltyness - 5 months
"I'm In!"
By @guender - 5 months
This would only be relevant for shit journals that use authors’ pdfs directly and don’t do their own processing.
By @motohagiography - 5 months
feels like a neural net could detect these by scoring and ranking the relevance of the content of a reference paper to the content of the paper citing it. maybe it's time for a citizen science project to dismantle these academic fraud rings? they form networks that capture academic administrations and have significant downstream effects on policy and education. just by identifying the worst and most egregious offenders, leaving the merely dodgy alone, it could break up the hold they have on institutions.