AI Literacy Chapter 4 4 min read

Lecture 4: AI's Limitations and Risks — Hallucination, Bias, and Privacy

O
OIYO Editorial Contributor
4/5

The Danger of Blindly Trusting AI

AI tools like ChatGPT and Claude demonstrate remarkable capabilities — but they also have critical limitations. The core of AI literacy is understanding both AI’s abilities and its limitations at the same time.

AI Capabilities vs. Limitations — A Realistic Assessment
DomainWhat AI Does WellWhat AI Cannot Do
KnowledgeReproducing patterns from vast training dataCurrent information, expert-verified knowledge
LanguageGenerating fluent, structured textTrue understanding, grasping intent
ReasoningFollowing logical stepsCreative reasoning about genuinely novel concepts
ReliabilityMaintaining consistent styleRecognizing and correcting its own errors

Hallucination

The pattern by which AI fails in the most dangerous way.

Hallucination is when an LLM confidently generates facts that don’t exist. Despite the name, AI isn’t actually confused — it’s simply producing the most plausible next tokens, which may have no basis in reality.

Types of Hallucination and Real Examples
TypeDescriptionReal-World Case
Non-existent citationsCites papers or books that don't existUS lawyer ChatGPT incident — cited fabricated case law and faced court sanctions
Date and number errorsConfidently states incorrect figuresWrong corporate financial figures, faulty statistics
Blended informationIncorrectly merges multiple factsMixing the biographical details of two different people
Missing recent informationUnaware of events after training cutoffMakes incorrect guesses about post-cutoff events
Strategies to prevent acting on hallucinations:
→ Always verify important facts from original sources
→ "Tell me the source" is not enough — AI can fabricate sources too
→ High-stakes domains (legal, medical, financial) require expert review
→ Narrow questions to specific, verifiable claims
→ Treat AI responses as drafts; humans must verify

AI Bias

AI can absorb the biases embedded in its training data — and amplify them.

Types of AI Bias and Real Harms
Type of BiasCauseReal-World Case
Data BiasTraining data over- or under-represents certain groupsMIT study: facial recognition misidentified Black women at a 35% error rate
Historical BiasLearns from past discriminatory patternsAmazon's hiring AI penalized resumes from women (2018)
Confirmation Bias AmplificationRecommends content that reinforces existing beliefsSocial media filter bubbles, radicalization
Linguistic BiasEnglish-dominated training dataLower performance and accuracy for non-English languages
1
Recognize Bias

Consciously check whether AI outputs favor or disadvantage certain groups (by gender, race, age, or nationality).

2
Diversify Outputs

Compare results from multiple AI tools and try prompts from different perspectives.

3
Be Cautious with High-Stakes Decisions

Do not use AI as the sole criterion for important decisions such as hiring, loan approvals, or medical diagnoses.

4
Provide Feedback

Report incorrect AI outputs. Collective feedback contributes to model improvement.


Privacy and AI

Privacy Risks When Using AI
Risk TypeDescriptionPrevention
Inclusion in Training DataInformation you input may be used for future model trainingCheck the service's terms; opt out of data training if possible
Leakage of Sensitive InformationAI may regenerate personal information about othersNever input real names, ID numbers, or account details
Corporate Confidentiality LeaksEntering work information into external AI poses risksUse enterprise AI or minimize what you input
Copyright InfringementAI-generated content may infringe on training data copyrightsCheck the AI service's policy before commercial use

Deepfakes and AI Disinformation

Types of AI-Generated Disinformation
TypeTechnologyHarm Examples
Deepfake VideoFace synthesis via GAN or DiffusionCelebrity impersonation scams, election disinformation
Voice CloningClones a voice from just 5 seconds of audioFake kidnapping phone calls impersonating family members
AI-Generated TextAutomated mass production of fake news and commentsPublic opinion manipulation, fake reviews
Synthetic ImagesPhotos of events that never happenedSpreading false information about wars or disasters

Perfect detection is difficult, but look for: (1) unnatural face edges or hair, (2) out-of-sync blinking and lip movement, (3) asymmetric earrings or glasses, (4) inconsistent backgrounds, (5) verify the originating account. AI detection tools like Deepware and Microsoft Video Authenticator can also help.


Key Takeaways

Hallucination: AI confidently generates non-existent facts → always verify important information from original sources Bias: Training data prejudices are absorbed and amplified by AI → never use AI alone for high-stakes decisions Privacy: Never input sensitive information into external AI → use dedicated enterprise AI for confidential work Deepfakes: AI-generated disinformation is surging → critically verify the source and context of information

O

OIYO Editorial

Content Editor

지식 인큐베이터이자 전문 콘텐츠 크리에이터. 경영, 경제, 법률 및 실생활에 유용한 실무/자격증 중심의 깊이 있는 정보를 연구하고 공유합니다.