Sapling Logo

AI Detector



Detect whether text is AI-generated.

This free AI writing detector outputs the probability that a text is AI-generated by a model such as ChatGPT or Gemini. This can be helpful for educators, SEO practitioners, and reviewers of user-generated content.

Tags: ai detector ai checker chatgpt detector ai content detector gpt-4 detector ai text detector



Developed by former researchers at:
Cal Stanford
Meta Google

Instructions

Type or paste text above to score. Note that the AI detector becomes much more accurate after 50 or so words. The token count (approximately the word count) will be shown as part of the score output.

No current AI content detector (including Sapling's) should be used as a standalone check to determine whether text is AI-generated or written by a human. False positives and false negatives will occur.

The top section will show the overall score and highlight portions of the text that appear to be AI-generated.

The bottom section will highlight individual sentences that may be AI-generated due to low measures of perplexity (sentences that are cliché or simplistic will be flagged).

The detector for the entire text and the per-sentence detector use complementary techniques, so use them together (along with your best judgment) to make an assessment.

Looking for other ways to score content? Contact us.


Changelog

New!

  • Better text normalization.
  • Versioning of different detector versions (available in API).
  • Significantly better performance on recent ChatGPT models.

Recent

  • Better handling of whitespace.
  • Improved robustness to small changes.
  • Better handling of different LLMs.

Coming soon

  • Improved support for AI-generated code and technical content (currently the system tries to avoid predictions for code).
  • Increased text length limit (8K characters currently available for Pro users).




Frequently Asked Questions

Recently, models such as ChatGPT (GPT-4), Bing Chat, Claude, and Google's Gemini have led to the rise of machine-generated content. This synthetic content is increasingly indistinguishable from human-written content.

Despite rapid progress, these models continue to have shortcomings such as hallucinated facts as well as consequences such as enabling cheating in language courses.

This AI checker tool provides a way of screening whether a piece of content is written by a human or machine.

The AI detector uses a machine learning system (a Transformer) similar to that used to generate AI content. Instead of generating words, the AI detector instead generates the probability it thinks each word or token in the input text is AI-generated or not. The results are visualized above for the entire text as well as for each sentence.

Yes! You can install the AI Content Detector for ChatGPT extension by Sapling.ai. This extension will allow you to check for AI-generated content anywhere on the web. Select text on any webpage, then click the Detect AI button to see a complete analysis of the selected text.

When using chatbots such as ChatGPT, the Detect AI button will be embedded next to each generated result allowing you to easily run AI detection with a single click.

You will also be able to edit the analyzed text and recheck your work. This will allow you to easily fix the sections that have been flagged as AI-generated.

Accuracy must be measured on a specific test or benchmark. There are also multiple measurements of "accuracy" for detection tools. These measurements balance catching as many AI-generated texts as possible while keeping false positives low. On our internal benchmarks, Sapling catches more than 97% of AI-generated texts while keeping false positives below 3%. Please note that these benchmarks tend to use longer texts and may not be representative of your text.

Sapling's detector can have false positives. The shorter the text is, the more general it is, and the more essay-like it is, the more likely it is to result in a false positive. We are working on improving the system so that this occurs less frequently.

The free version is currently truncated to 2000 characters (roughly 400 to 500 tokens). Pro and Enterprise Sapling subscribers can paste texts of up to 8000 characters (roughly 1600 to 2000 tokens). For texts longer than that, please break up the text into multiple sections, or consider using our API. If you plan to process more than 5 million characters/month, contact us to see how we can better support your use case.

Yes — you can find the API documentation here.

While language models are evolving and have their differences, they usually use a similar machine learning architecture and a similar dataset on which they're trained. Hence, even AI detectors trained on different and earlier versions of language model outputs should perform significantly better than random on other models.

That said, to get the best performance, detectors should be trained on the outputs of the latest systems. Sapling regularly retrains and finetunes its detector to keep it up-to-date with new systems.

We invite you to collect a small dataset of say a dozen examples (of say blog posts and essays) and try for yourself :-).

We've seen such tools make false claims such as that a text "passed" Sapling.ai's detector even though no check was performed by Sapling.ai. Please be careful when using such tools.