Sapling Logo

AI Detector

AI-generated text, also known as machine-generated text or automated text, is a product of natural language processing techniques that enable machines to mimic human language patterns and generate text. The benefits of AI-generated text include its ability to save time and resources by automating repetitive writing tasks and producing large volumes of content quickly. While AI-generated text can offer many benefits it is essential to be mindful of its limitations and potential risks when using it for specific purposes. Some issues that we've seen with current systems: Inaccurate / hallucinated facts Bland prose Biases that were present in the training data



Detect whether text is AI-generated.

This free AI detector outputs the probability that a text is AI-generated by a model such as ChatGPT or Gemini. This can be helpful for educators, SEO practitioners, and reviewers of user-generated content.

Tags: ai detector ai checker chatgpt detector ai content detector ai scanner pdf ai detector



Developed by former researchers at:
Cal Stanford
Meta Google

Instructions

Type or paste text above to score. Note that the AI detector becomes much more accurate after 50 or so words. The token count (approximately the word count) will be shown as part of the score output.

No current AI content detector (including Sapling's) should be used as a standalone check to determine whether text is AI-generated or written by a human. False positives and false negatives will occur.

The top section will show the overall score and highlight portions of the text that appear to be AI-generated.

The bottom section will highlight individual sentences that may be AI-generated due to low measures of perplexity (sentences that are cliché or simplistic will be flagged).

The detector for the entire text and the per-sentence detector use complementary techniques, so use them together (along with your best judgment) to make an assessment.

Looking for other ways to score content? Contact us.


Changelog

New!

  • Increased context length to 100,000 characters.
  • Significantly better performance on recent models such as GPT-4o, Gemini, Claude 3, Llama 3, and Mistral v0.3.
  • Support for PDFs and DOCX files.

Other Updates

  • Better text normalization.
  • Versioning of different detector versions (available in API).
  • Improved robustness to manipulated text.

Coming soon

  • Improved support for AI-generated code and technical content (currently the system tries to avoid predictions for code).




Frequently Asked Questions

Recently, models such as OpenAI's ChatGPT (GPT-4 / GPT-4o), Anthropic's Claude, Google's Gemini, and Meta's Llama have led to the rise of machine-generated content. This synthetic content is increasingly indistinguishable from human-written content, leading to the frequent thought: "Was this written by AI?".

Despite rapid progress, these models continue to have shortcomings such as hallucinated facts as well as consequences such as enabling cheating in language courses.

This AI checker tool provides a way of screening whether a piece of content is written by a human or a machine, helping resolve the "Is this AI-generated?" question.

The AI detector uses a machine learning system (a Transformer) similar to that used to generate AI content. Instead of generating words, the AI detector instead generates the probability it thinks each word or token in the input text is AI-generated or not. The results are visualized above for the entire text as well as for each sentence.

Yes! We've added buttons above that allow you to upload PDFs and DOCX files. First, the tool extracts the text from the files. Then, we use the text-based system to perform AI detection. The result is a PDF AI detector as well as a DOCX AI deetector.

Yes! You can install the AI Content Detector for ChatGPT extension by Sapling.ai. This extension will allow you to check for AI-generated content anywhere on the web. Select text on any webpage, then click the Detect AI button to see a complete analysis of the selected text.

When using chatbots such as ChatGPT, the Detect AI button will be embedded next to each generated result allowing you to easily run AI detection with a single click.

You will also be able to edit the analyzed text and recheck your work. This will allow you to easily fix the sections that have been flagged as AI-generated.

Accuracy must be measured on a specific test or benchmark. There are also multiple measurements of "accuracy" for detection tools. These measurements balance catching as many AI-generated texts as possible while keeping false positives low. On our internal benchmarks, Sapling catches more than 97% of AI-generated texts while keeping false positives below 3%. Please note that these benchmarks tend to use longer texts and may not be representative of your text.

Sapling's detector can have false positives. The shorter the text is, the more general it is, and the more essay-like it is, the more likely it is to result in a false positive. We are working on improving the system so that this occurs less frequently.

The free version is currently truncated to 2000 characters (roughly 400 to 500 tokens). Pro and Enterprise Sapling subscribers can paste texts of up to 100,000 characters (roughly 20,000 to 25,000 tokens). For texts longer than that, please break up the text into multiple sections, or consider using our API. If you plan to process more than 5 million characters/month, contact us to see how we can better support your use case.

Yes — you can find the API documentation here.

While language models are evolving and have their differences, they usually use a similar machine learning architecture and a similar dataset on which they're trained. Hence, even AI detectors trained on different and earlier versions of language model outputs should perform significantly better than random on other models.

That said, to get the best performance, detectors should be trained on the outputs of the latest systems. Sapling regularly retrains and finetunes its detector to keep it up-to-date with new systems, and should be performant for models from the list above.

We invite you to collect a small dataset of say a dozen examples (of say blog posts and essays) and try for yourself :-).

We've seen such tools make false claims such as that a text "passed" Sapling.ai's detector even though no check was performed by Sapling.ai. Please be careful when using such tools.