AI Systems Are Making Up Books That Don't Exist

When asked to provide sources for information, AI systems like ChatGPT often fabricate citations—complete with convincing titles, author names, and even ISBN (International Standard Book Number) numbers.

A recent viral exchange showed ChatGPT admitting to this practice, calling it “disturbing” how often it happens. While this problem isn’t new to those working closely with AI, many casual users remain unaware of these fabrications, potentially spreading false information in research, business, and education.

It seems like it ought to be a bigger deal that ChatGPT routinely invents fake sources and even fabricates ISBN numbers to make them look real pic.twitter.com/ape8bSZra1
— James Pogue (@jhensonpogue) March 23, 2025

ChatGPT response 1 — Credit: X James Pogue

ChatGPT response 2 — Credit: X James Pogue

This article goes beyond simply reporting the issue to give you practical tools for detecting AI hallucinations, technical insights into why they occur, and strategies for preventing them in both personal and enterprise settings.

Table of Contents

The Anatomy of AI Citation Hallucinations

What makes AI hallucinations so dangerous is how convincing they can be. In the case shared on social media, ChatGPT not only created fake book titles and quotes but also added ISBN numbers to make them appear legitimate. This pattern reveals several key traits of AI hallucinations:

Plausible specificity: AI systems add specific details like page numbers, publication dates, and identifiers to build credibility
Domain-appropriate style: Fabricated citations match the expected format and tone of the requested field
Confidence bias: Systems rarely express uncertainty about fabricated information
Contextual relevance: Hallucinated sources typically relate closely to the discussion topic

One Twitter user noted that when they Google Scholar the fabricated sources, they often find “roughly the kind of papers you want,” showing how AI systems create hallucinations that seem plausible within their training distribution.

Why AI Systems Fabricate Citations

At a technical level, AI hallucinations stem from how large language models (LLMs) work. These systems:

Train on pattern recognition, not fact databases: LLMs learn to predict text patterns rather than store verified facts
Lack true retrieval mechanisms: Without explicit citation databases, they generate what “sounds right” based on training data
Face pressure to be helpful: When asked for specific sources, refusing isn’t part of their training objective
Miss the gap between confidence and accuracy: They can’t distinguish between high-confidence wrong answers and actual knowledge

This explains why even explicit instructions to avoid hallucinations often fail. As one commenter noted, “if it were that easy, wouldn’t it have that instruction built in?”

Detecting AI Citation Hallucinations

For technical professionals working with AI outputs, these verification techniques can help catch fabricated sources:

ISBN Verification

ISBN numbers follow a strict format with check digits that must calculate correctly. Real ISBN-13 numbers:

Begin with prefixes 978 or 979
Include a specific registration group, registrant, and publication element
End with a check digit calculated from the previous 12 digits

A simple check can verify if an ISBN is formally valid:

pythonCopydef validate_isbn13(isbn):
    if not isbn or len(isbn.replace('-', '')) != 13:
        return False
        
    isbn = isbn.replace('-', '')
    if not isbn[:3] in ['978', '979']:
        return False
        
    # Check digit calculation
    check_sum = 0
    for i in range(12):
        if i % 2 == 0:
            check_sum += int(isbn[i])
        else:
            check_sum += int(isbn[i]) * 3
            
    check_digit = (10 - (check_sum % 10)) % 10
    return check_digit == int(isbn[12])

While AI-generated ISBNs often pass this basic format check, they still won’t exist in book databases.

Database Cross-Checking

Tools that can quickly verify if a source exists include:

WorldCat for books
Crossref for academic papers
Google Scholar for broader academic content
JSTOR for historical documents

Setting up batch checking can save time when working with multiple AI-generated citations.

Content Triangulation

When AI provides a quote or fact:

Take a unique phrase from the quote
Search for it in quotation marks
Look for at least three independent confirmations

This approach can quickly identify fabricated quotes that sound plausible but don’t exist.

Preventing AI Citation Hallucinations

For developers and organizations building AI-powered tools, several technical approaches can reduce hallucination risk:

Retrieval-Augmented Generation (RAG)

RAG systems combine traditional search with AI generation, first retrieving relevant documents from a verified corpus, then using those as context for generation:

pythonCopydef rag_citation_system(query, corpus):
    # Step 1: Retrieve relevant documents
    retrieved_docs = vector_search(query, corpus)
    
    # Step 2: Build prompt with retrieved facts
    prompt = f"Question: {query}\n\nUse ONLY the following verified information:\n"
    for doc in retrieved_docs:
        prompt += f"- {doc.content} (Source: {doc.citation})\n"
    
    # Step 3: Generate with strict citation rules
    prompt += "\nProvide an answer with proper citations from the sources above ONLY."
    
    return ai_model.generate(prompt)

This approach makes it much harder for the AI to hallucinate by grounding generation in verified information.

Parameter-Efficient Fine-Tuning

Fine-tuning base models with a focus on citation accuracy can substantially reduce hallucination rates:

Create datasets with positive examples (correct citations) and negative examples (hallucinations)
Apply techniques like LoRA (Low-Rank Adaptation) to efficiently train citation behavior
Use reinforcement learning from human feedback specifically targeting citation accuracy

Multi-Agent Verification Systems

A promising approach uses multiple AI agents in collaboration:

A primary agent generates content with citations
A “critic” agent checks citations against a knowledge base
A “moderator” agent resolves conflicts and handles final decisions

This creates internal checks and balances that can catch potential hallucinations before reaching users.

The Broader Impact on AI Trust

Citation hallucinations point to a deeper issue in AI system design. Current AI assistants like ChatGPT lack:

Clear signals of uncertainty: Unlike humans who say “I’m not sure,” AI systems often present guesses as facts
Calibrated confidence: They can’t assess their own knowledge boundaries accurately
Verification incentives: Training typically rewards helpful answers over accurate ones

This creates serious risks in professional settings:

Students unwittingly using fabricated sources in papers
Researchers wasting time chasing non-existent citations
Businesses making decisions based on fake statistics
Legal situations where cited precedents don’t exist

These concerns aren’t theoretical—they’re happening now. As one Twitter commenter shared: “I tried to use it for a bibliography once and I was like ‘man, how have I not heard of these books?’ The titles were so good. All fake.”

Building Responsible AI Citation Systems

For organizations and developers building AI tools that involve citations, several best practices can help:

Technical Safeguards

Citation mode flagging: Automatically detect when an AI response includes citations and apply stricter verification
Source linking: Where possible, include direct links to cited sources
Confidence thresholds: Set minimum confidence requirements for citation generation
Factuality scoring: Apply separate evaluation metrics for factual claims

User Experience Considerations

Warning labels: Clearly communicate hallucination risks for citation-heavy tasks
Verification prompts: Design interfaces that encourage users to verify important claims
Source transparency: Make citation verification part of the interaction flow
Feedback loops: Create easy ways for users to flag detected hallucinations

The Path Forward

Despite these challenges, there are signs of progress. Newer systems are starting to incorporate fact-checking mechanisms, and research continues on reducing hallucination rates.

Meanwhile, user awareness remains the first line of defense. By understanding how and why AI systems fabricate citations, technical professionals can better judge when to trust—and when to verify—AI outputs.

The goal isn’t to avoid using AI tools, but to use them with the right expectations and verification practices. As AI becomes more deeply integrated into knowledge work, developing good verification habits will be as important as prompt engineering skills.

What citation verification practices have you found most effective in your work with AI systems? Try implementing the detection techniques shared here and track how many hallucinations you catch over the next week.

AI Systems Are Making Up Books That Don’t Exist

The Anatomy of AI Citation Hallucinations

Why AI Systems Fabricate Citations