The Rosenblatt perceptron (1958) is an early learning unit: it scales inputs by weights, adds a bias, applies a threshold, and outputs a binary decision (0 or 1).
z = Σ XiWi + b
ŷ = 1 if z ≥ 0, else 0 (step / threshold activation)
When the model misclassifies, weights update proportionally to error:
ΔW = η · (y − ŷ) · X then Wnew = Wold + ΔW
η is the learning rate — how large each corrective step is.
A single perceptron solves only linearly separable problems — data that can be split by one straight line (or hyperplane). AND and OR gates work; XOR does not.
Minsky & Papert (1969) proved a single layer cannot learn XOR. That limitation contributed to the first AI winter and motivated multi-layer networks with non-linear hidden layers.