AI Literacy Chapter 3 4 min read

Lecture 3: Deep Learning and Neural Networks — How AI Perceives the World

O
OIYO Editorial Contributor
3/5

Deep Learning: Inspired by the Brain

Artificial neural networks (ANNs) draw inspiration from the structure of neurons in the brain — but how they actually work is quite different from biological brains.

Biological Neuron vs. Artificial Neuron
PropertyBiological NeuronArtificial Neuron
InputReceives chemical signals via dendritesSum of numbers (weight × input value)
ProcessingFires an electrical signal when threshold is exceededPasses through an activation function (ReLU, Sigmoid)
OutputTransmitted to the next neuron via the axonPasses a number to the next layer
LearningSynaptic strength changes (Hebbian learning)Weights adjusted via backpropagation

How Neural Networks Work

1
Forward Pass

Input data is transformed as it passes through multiple layers. At each layer, it is multiplied by weights and passed through an activation function to produce the final output.

2
Loss Calculation

The difference between the model's output and the correct answer is quantified. The smaller the loss, the more accurate the model.

3
Backpropagation

To reduce the loss, the error is propagated backward from the output layer to the input layer. This calculates how much each weight contributed to the loss.

4
Gradient Descent

Weights are adjusted slightly in the direction opposite to the computed gradient. After millions of iterations, the model becomes progressively more accurate.

An intuitive analogy:
→ Descending a mountain blindfolded (gradient descent)
→ Feeling the slope underfoot (gradient = slope)
→ Taking one step downhill at a time (weight update)
→ Eventually reaching the lowest point (minimum loss)

Challenges:
→ Global minimum vs. local minimum
→ Learning rate: too large → oscillates, too small → slow

Revolutionary Deep Learning Architectures

Comparison of Major Deep Learning Architectures
ArchitectureSpecializationCore IdeaKey Applications
CNN (Convolutional Neural Network)Images & videoExtracts spatial patterns using filtersFace recognition, medical imaging, self-driving cars
RNN / LSTMSequential dataRemembers prior information (gate mechanism)Translation, speech recognition (pre-2017)
TransformerText & general purposeAttention processes entire context in parallelGPT, BERT, ChatGPT, Claude
GAN (Generative Adversarial Network)Image & audio generationGenerator vs. discriminator competingImage generation, deepfakes
Diffusion ModelImage generationGenerates images by reversing a noise processDALL·E, Stable Diffusion

CNN — The Eyes of AI

CNN's Hierarchical Feature Extraction
LayerFeatures ExtractedAnalogy
Early Layers (Low-level)Edges, lines, color changesBasic shapes like dots, lines, circles
Middle Layers (Mid-level)Textures, patterns, corner combinationsBody parts like eyes, nose, ears
Later Layers (High-level)Full faces, cars, animalsRecognizing cats, dogs, people

In 2012, AlexNet halved the error rate in the ImageNet classification competition compared to prior methods. That moment marked the beginning of the deep learning revolution. Since then, AI’s visual capabilities have surpassed human-level performance.


Transformer — The Engine of Modern AI

Building on the introduction in Lecture 1, let’s understand Transformers more deeply.

RNN vs. Transformer
PropertyRNN/LSTMTransformer
Processing MethodSequential (one step at a time)All at once (parallel)
Long-Range DependenciesDistant information fadesAttention connects any position directly
Training SpeedCannot parallelize → slowParallelizable → fast
Model SizeMillions of parametersTens to hundreds of billions of parameters
Representative ModelsLSTM translation modelsGPT-4, Claude, Gemini
The Transformer's core innovation — Attention:
Sentence: "The animal didn't cross the street because it was too tired."

RNN: "it" → information passed sequentially from earlier → distant "animal" fades
Transformer: "it" → attends to every word simultaneously → connects directly to "animal"

→ Correctly identifies "it = animal"
→ This is how LLMs understand complex context

Limitations of Deep Learning

Key Limitations of Deep Learning
LimitationExplanationReal-World Problem
Black BoxCannot explain why it reached a conclusionTrust issues with medical and legal AI
Data DependencyRequires large volumes of high-quality dataInsufficient AI for diagnosing rare diseases
Distribution ShiftFails when test conditions differ from trainingSelf-driving cars encountering unfamiliar roads
Compute CostTraining requires enormous energy and expenseEnvironmental impact, accessibility inequality

Key Takeaways

Backpropagation: propagate error backward → adjust weights → repeat CNN: hierarchical feature extraction — edges → parts → whole objects Transformer: sequential processing → parallel Attention → speed and performance revolution Deep learning limitations: black box + data dependency + compute cost

O

OIYO Editorial

Content Editor

지식 인큐베이터이자 전문 콘텐츠 크리에이터. 경영, 경제, 법률 및 실생활에 유용한 실무/자격증 중심의 깊이 있는 정보를 연구하고 공유합니다.