The Neuron

NN from Scratch

A neural network is just a function composed of many simple functions stacked together. Strip away the buzzwords: inputs → weighted sums → activation → repeat → output.

1. A Single Neuron

Each neuron computes one tiny formula:

output = activation( w₁x₁ + w₂x₂ + … + wₙxₙ + b )

Think of it as a yes/no/maybe detector. It takes inputs, weighs each one by importance (weights), adds a nudge (bias), then decides whether to fire through an activation function.

Each connection has a number: big weight → important input. Near-zero weight → ignored. Negative weight → suppresses the input.

Drag the sliders to change inputs and watch the neuron respond

0.50

-0.30

0.70

Step-by-step computation:

z = (0.50 × 0.50) + (-0.30 × -0.30) + (0.70 × 0.80) + 0.10

z = 0.250 + 0.090 + 0.560 + 0.10 = 1.000

ŷ = ReLU(1.000) = 1.000

Python

Shift+Enter to run

2. A Layer of Neurons

A layer is just many neurons running in parallel. Every neuron looks at the same inputs but with different weights — so each learns to detect a different pattern.

In a weight matrix, each row = one neuron's weights. Computing a layer's output is just doing the dot product for every neuron at once.

Same inputs, different weights per neuron — each neuron detects a different pattern

1.00

2.00

3.00

2.50

Weight Matrix — each row is one neuron's weights

	w·1	w·2	w·3	w·4	bias	output
N₁	w11 0.20	w12 0.80	w13 -0.50	w14 1.00	2.00	4.800
N₂	w21 0.50	w22 -0.91	w23 0.26	w24 -0.50	3.00	1.210
N₃	w31 -0.26	w32 -0.27	w33 0.17	w34 0.87	0.50	2.385

Python

Shift+Enter to run

3. Batch Input

In practice, we don't feed one sample at a time — we process a batch of samples at once. Each row in the input matrix is one sample, and the layer processes all of them simultaneously.

This is just matrix multiplication: Output = ReLU(X @ W.T + b). The GPU does all rows in parallel — that's why neural networks are fast.

Click a sample row to select it — the neuron diagram shows that sample flowing through the layer

Sample s1 through the layer

Input Batch (3 samples)

	X1	X2	X3	X4
s1	1.00	2.00	3.00	2.50
s2	-0.80	0.40	0.10	1.20
s3	0.20	0.90	-0.50	0.30

→

Layer
(2 neurons)

Output (3 × 3)

	y1	y2	y3
s1	4.800	1.210	2.385
s2	3.310	1.662	1.661
s3	3.310	2.001	0.381

Editing sample s1

1.00

2.00

3.00

2.50

Python

Shift+Enter to run

One-line summary

A neural network is a giant stack of weighted sums + nonlinear activations that learns patterns from data.

Single neuron → weighted sum + bias + activation = one tiny detector

Layer → many neurons in parallel = many detectors running together

Batch → many samples at once = matrix multiplication = GPU go brrrr