TechBriefly
  • Tech
  • Business
  • Crypto
  • Science
  • Geek
  • How to
  • About
    • About TechBriefly
    • Terms and Conditions
    • Privacy Policy
    • Contact Us
    • Languages
      • 中文 (Chinese)
      • Dansk
      • Deutsch
      • Español
      • English
      • Français
      • Nederlands
      • Italiano
      • 日本语 (Japanese)
      • 한국인 (Korean)
      • Norsk
      • Polski
      • Português
      • Pусский (Russian)
      • Suomalainen
      • Svenska
No Result
View All Result
TechBriefly
Home Tech AI
Avoid these words at all cost if you don’t want to get caught using AI

Avoid these words at all cost if you don’t want to get caught using AI

Research shows at least 10% of scientific abstracts in 2024 were either generated or significantly assisted by LLMs. Here are the most common ones.

Emre ÇıtakbyEmre Çıtak
11 July 2024
in AI
Reading Time: 3 mins read
Share on FacebookShare on Twitter

Detecting AI-generated text has long been a challenge for researchers and developers. With the rapid advancement of large language models (LLMs), such as Google’s Gemini Advanced and OpenAI’s GPT-4o, the ability to produce human-like text has become increasingly sophisticated.

However, a new study from researchers at the University of Tübingen and Northwestern University offers a breakthrough in identifying AI-crafted content.

By focusing on the sudden surge in specific vocabulary in scientific writing, they have developed a method to detect the use of LLMs with surprising accuracy. This technique, inspired by pandemic studies that measured excess deaths, reveals how changes in word usage can signal the presence of AI-generated text.

Here are common words in AI-generated content
Researchers developed a method to identify AI-generated text based on sudden surges in specific vocabulary in scientific writing (Image credit)

What are the words that give AI content away?

To measure these changes, the team scrutinized the frequency of each word annually. By comparing the expected word frequency, based on pre-2023 trends, to actual usage in 2023 and 2024, they identified a dramatic increase in certain terms. For example, the word “delves” appeared 25 times more frequently in 2024 abstracts than anticipated. Similarly, “showcasing” and “underscores” saw a ninefold increase in usage.

Here are the most used words in AI-generated text with their corresponding rates of increase in usage:

  • Delves – 25 times increase
  • Showcasing – 9 times increase
  • Underscores – 9 times increase
  • Potential – 4.1 percentage points increase
  • Findings – 2.7 percentage points increase
  • Crucial – 2.6 percentage points increase
  • Across – significant increase (exact rate not specified)
  • Additionally – significant increase (exact rate not specified)
  • Comprehensive – significant increase (exact rate not specified)
  • Enhancing – significant increase (exact rate not specified)
  • Exhibited – significant increase (exact rate not specified)
  • Insights – significant increase (exact rate not specified)
  • Notably – significant increase (exact rate not specified)
  • Particularly – significant increase (exact rate not specified)
  • Within – significant increase (exact rate not specified)

These words have become telltale signs of AI involvement, showing up far more frequently than expected. While language evolves naturally, such abrupt changes are unusual and often tied to major global events.

In this case, the widespread use of LLMs has led to a noticeable shift in the vocabulary of scientific literature.

Inspiration from pandemic analysis

The researchers’ approach draws heavily from techniques used during the COVID-19 pandemic. Just as excess deaths were calculated by comparing observed fatalities to historical data, this study compares current word usage against historical trends to identify anomalies. They analyzed over 14 million scientific abstracts published on PubMed from 2010 to 2024, identifying a significant uptick in certain words starting in late 2022, coinciding with the broader adoption of LLMs.

The researchers noted that the rise in specific words, termed “marker words,” is a clear indicator of LLM usage. This phenomenon differs from past vocabulary shifts linked to events like the COVID-19 pandemic, which saw an increase in noun-heavy language.

Here are common words in AI-generated content
The rise in specific words, called “marker words,” is a clear indicator of LLM usage (Image credit)

In contrast, the post-LLM period has seen a spike in verbs, adjectives, and adverbs. This shift highlights how AI-generated text subtly changes the texture and style of writing.

By identifying these marker words, the researchers estimate that at least 10% of scientific abstracts in 2024 were either generated or significantly assisted by LLMs. This estimate is likely conservative, as not all AI-assisted texts will contain these specific markers. Nonetheless, the presence of these words provides a reliable metric for detecting AI influence in academic writing.

Geographical trends in LLM usage

The study also uncovered geographical variations in the adoption of LLMs. Countries like China, South Korea, and Taiwan showed a higher frequency of marker words in scientific papers, indicating that LLMs are particularly valuable for non-native English speakers. These tools help refine and enhance their writing, making it more polished and publication-ready.

Conversely, native English speakers may be more skilled at recognizing and eliminating these markers, thereby concealing their use of AI. This difference suggests that while LLMs are widely used across the globe, their impact is more pronounced in regions where English is not the primary language.


Featured image credit: Freepik

Tags: AIfeatured
ShareTweet
Emre Çıtak

Emre Çıtak

Emre’s love for animals made him a veterinarian, and his passion for technology made him an editor. Making new discoveries in the field of editorial and journalism, Emre enjoys conveying information to a wide audience, which has always been a dream for him.

Related Posts

Narwal unveils Flow 2 with AI pet monitoring at CES 2026

Narwal unveils Flow 2 with AI pet monitoring at CES 2026

6 January 2026
Amazon takes Alexa to the web with launch of Alexa.com at CES 2026

Amazon takes Alexa to the web with launch of Alexa.com at CES 2026

6 January 2026
Google previews Gemini AI features for Google TV

Google previews Gemini AI features for Google TV

6 January 2026
Kodiak AI partners with Bosch on autonomous semi truck systems

Kodiak AI partners with Bosch on autonomous semi truck systems

6 January 2026

LATEST

How to download and migrate your content from Microsoft Stream

Easy ways to make a YouTube music video with just pictures

Easy steps to build your own music video for YouTube

How to add videos and movies to compatible iPod models easily

Narwal unveils Flow 2 with AI pet monitoring at CES 2026

Hyundai reveals Boston Dynamics and DeepMind alliance at CES 2026

Intel unveils Core Ultra Series 3 at CES 2026

Amazon takes Alexa to the web with launch of Alexa.com at CES 2026

Amazon enters lifestyle TV market with $899 Ember Artline

Google previews Gemini AI features for Google TV

TechBriefly

© 2021 TechBriefly is a Linkmedya brand.

  • Tech
  • Business
  • Science
  • Geek
  • How to
  • About
  • Privacy
  • Terms
  • Contact
  • | Network Sites |
  • Digital Report
  • LeaderGamer

Follow Us

No Result
View All Result
  • Tech
  • Business
  • Crypto
  • Science
  • Geek
  • How to
  • About
    • About TechBriefly
    • Terms and Conditions
    • Privacy Policy
    • Contact Us
    • Languages
      • 中文 (Chinese)
      • Dansk
      • Deutsch
      • Español
      • English
      • Français
      • Nederlands
      • Italiano
      • 日本语 (Japanese)
      • 한국인 (Korean)
      • Norsk
      • Polski
      • Português
      • Pусский (Russian)
      • Suomalainen
      • Svenska