micrograd C++

A self-contained, lightweight scalar autograd engine and neural network library in C++17, inspired by Andrej Karpathy's micrograd. Designed purely for educational clarity, splitting the codebase into modular subdirectories representing different abstraction layers, while preserving a facade convenience header micrograd.hpp at the root.

This edition features fully unabbreviated variable and parameter names, using weights, bias, number_of_inputs, number_of_outputs, number_of_outputs_list, inputs, activation, non_linear, and explicit .forward() calls instead of standard shorthand symbols or C++ function operators (operator()) for ultimate readability.

Repository Structure

microgradcpp/
├── autograd/
│   ├── Value.hpp       (Value node and overloaded math operators declarations)
│   └── Value.cpp       (Value backpropagation math and operator implementations)
├── neural_network/
│   ├── NN.hpp          (Neuron, Layer, and MLP declarations)
│   └── NN.cpp          (Weights, bias parameters, and forward neural layouts)
├── training/
│   ├── training_loop.hpp (MLP gradient descent optimization trainer declaration)
│   └── training_loop.cpp (Training optimization implementation with loss epoch logging)
├── test/
│   └── test_micrograd.cpp (Engine unit tests checking gradients against PyTorch outputs)
├── micrograd.hpp       (Convenience facade umbrella header for root inclusions)
├── demo.cpp            (A simple Multi-Layer Perceptron training demo)
├── Makefile            (Direct compiler rules using clang++)
└── README.md           (This guide)

Detailed Developer Guide & Usage

This library is split into three main modules: the Autograd engine, the Neural Network layers, and the Training loops. Below are complete instructions and examples for using each of them.

1. Autograd Math Engine (`/autograd`)

The scalar autograd computational DAG is centered around the Value node (aliased as Val using std::shared_ptr<Value>). Nodes are linked dynamically through mathematical operators (+, -, *, /) and activation functions.

To compute local derivatives, build your algebraic expression and call .backward_pass() on the final output node:

#include "micrograd.hpp"
#include <iostream>

using namespace micrograd;

int main() {
    // 1. Instantiate leaf inputs
    Val x = make_shared<Value>(-4.0);
    Val w = make_shared<Value>(2.0);
    
    // 2. Perform forward math operations
    Val y = x * w + 5.0;     // y = -4 * 2 + 5 = -3.0
    Val loss = y->relu();    // loss = max(0, -3) = 0.0
    
    // 3. Backpropagate derivatives
    loss->backward_pass();
    
    // 4. Access gradients
    std::cout << "d(loss)/dx: " << x->grad << std::endl;
    std::cout << "d(loss)/dw: " << w->grad << std::endl;
    return 0;
}

2. Neural Network Construction (`/neural_network`)

Define fully connected multi-layer perceptron (MLP) layouts. An MLP consists of nested Layer structures, which in turn contain Neuron nodes.

Neuron: Accumulates inputs: $f(x) = \text{Activation}(\sum w_i x_i + bias)$.
Layer: Represents a standard fully connected layer grouping multiple independent Neurons.
MLP: A chained stack of layers processing inputs feedforward.

#include "micrograd.hpp"
#include <vector>

using namespace micrograd;

int main() {
    // Instantiate MLP mapping: 3 inputs -> hidden layer 4 -> hidden layer 4 -> 1 output
    MLP model(3, {4, 4, 1});
    
    // Forward pass
    std::vector<Val> input_sample = {make_shared<Value>(2.0), make_shared<Value>(1.5), make_shared<Value>(-1.0)};
    std::vector<Val> prediction = model.forward(input_sample);
    
    std::cout << "Output: " << prediction[0]->data << std::endl;
    return 0;
}

3. Modular Gradient Descent Training (`/training`)

Utilize the centralized train(...) API to train an MLP model using standard Stochastic Gradient Descent (SGD) with step learning rate rules:

#include "micrograd.hpp"
#include "training/training_loop.hpp"
#include <vector>

using namespace micrograd;

int main() {
    MLP model(3, {4, 4, 1});
    
    // Prepare binary classification dataset (4 samples)
    std::vector<std::vector<Val>> inputs = {
        {make_shared<Value>(2.0), make_shared<Value>(3.0), make_shared<Value>(-1.0)},
        {make_shared<Value>(3.0), make_shared<Value>(-1.0), make_shared<Value>(0.5)},
        {make_shared<Value>(0.5), make_shared<Value>(1.0), make_shared<Value>(1.0)},
        {make_shared<Value>(1.0), make_shared<Value>(1.0), make_shared<Value>(-1.0)}
    };
    std::vector<Val> targets = {
        make_shared<Value>(1.0),
        make_shared<Value>(-1.0),
        make_shared<Value>(-1.0),
        make_shared<Value>(1.0)
    };
    
    // Optimize model parameters (weights + bias) over 100 epochs, learning rate = 0.05
    train(model, inputs, targets, 100, 0.05);
    return 0;
}

How to Build and Run

Using the Makefile (Recommended)

Easily compile and execute targets using the simple Makefile:

# Compile and run the complete autograd and MLP unit test suite
make test

# Compile and execute the MLP network training loop converges showcase
make run-demo

# Build both binaries without running them
make build

# Remove compiled binary executables
make clean

Manual Compilation

Alternatively, compile using standard clang++ (or g++) compiler directives:

# Compile and execute the test suite
clang++ -O3 -std=c++17 test/test_micrograd.cpp autograd/Value.cpp neural_network/NN.cpp -I. -o test_micrograd && ./test_micrograd

# Compile and execute the MLP demo showcase
clang++ -O3 -std=c++17 demo.cpp autograd/Value.cpp neural_network/NN.cpp training/training_loop.cpp -I. -o demo && ./demo

Exact Mathematical Correctness

Sanity Assertions

Unit tests evaluate forward and backward passes against exact analytical values and PyTorch:

y->data = -20.0
x->grad = 46.0
g->data = 24.704081632653
a->grad = 138.833819241983
b->grad = 645.577259475219

MLP Optimization Loss

Optimizes an MLP(3, {4, 4, 1}) binary classifier down to < 0.0003 loss:

Training the MLP (100 epochs)...
  Epoch   1 | Loss: 1.097913
  Epoch  50 | Loss: 0.063801
  Epoch 100 | Loss: 0.000270

Predictions:
  Input: [2.0, 3.0, -1.0] | Prediction Value: +0.9862 | Target Value: 1.0000
  Input: [3.0, -1.0, 0.5] | Prediction Value: -1.0002 | Target Value: -1.0000
  Input: [0.5, 1.0, 1.0] | Prediction Value: -0.9942 | Target Value: -1.0000
  Input: [1.0, 1.0, -1.0] | Prediction Value: +0.9727 | Target Value: 1.0000

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

micrograd C++

Repository Structure

Detailed Developer Guide & Usage

1. Autograd Math Engine (`/autograd`)

2. Neural Network Construction (`/neural_network`)

3. Modular Gradient Descent Training (`/training`)

How to Build and Run

Using the Makefile (Recommended)

Manual Compilation

Exact Mathematical Correctness

Sanity Assertions

MLP Optimization Loss

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
autograd		autograd
neural_network		neural_network
test		test
training		training
.gitignore		.gitignore
Makefile		Makefile
README.md		README.md
autograd_output.txt		autograd_output.txt
demo.cpp		demo.cpp
micrograd.hpp		micrograd.hpp
training_output.txt		training_output.txt

Folders and files

Latest commit

History

Repository files navigation

micrograd C++

Repository Structure

Detailed Developer Guide & Usage

1. Autograd Math Engine (/autograd)

2. Neural Network Construction (/neural_network)

3. Modular Gradient Descent Training (/training)

How to Build and Run

Using the Makefile (Recommended)

Manual Compilation

Exact Mathematical Correctness

Sanity Assertions

MLP Optimization Loss

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

1. Autograd Math Engine (`/autograd`)

2. Neural Network Construction (`/neural_network`)

3. Modular Gradient Descent Training (`/training`)

Packages