NeRF from Scratch

A PyTorch implementation of Neural Radiance Fields (NeRF) built from scratch. This implementation focuses on learning and understanding the core concepts of NeRF while maintaining good performance and code readability.

Overview

Neural Radiance Fields (NeRF) represent scenes as continuous 5D functions that output the radiance emitted in each direction (θ, φ) at each point (x, y, z) in space. This implementation includes:

Custom NeRF model with positional encoding
Volume rendering pipeline
Training on synthetic datasets
Inference with novel view synthesis

Requirements

torch>=1.8.0
numpy>=1.19.2
Pillow>=8.0.0
tqdm>=4.50.0

Project Structure

NeRF-from-scratch/
├── source/
│   ├── model.py         # NeRF model architecture
│   ├── dataloaders.py   # Data loading and processing
│   ├── train.py         # Training pipeline
│   └── inference.py     # Novel view synthesis
├── nerf_synthetic/      # Dataset directory
└── README.md

Training

The training pipeline includes:

Positional encoding for coordinates and view directions
Hierarchical sampling along rays
Efficient batch processing with chunked rays
Learning rate scheduling
Gradient clipping for stability
Automatic checkpointing

To train the model:

python source/train.py

Training parameters:

Learning rate: 1e-4 with exponential decay (gamma=0.995)
Batch size: 1 (processes multiple rays per batch)
Number of epochs: 200 * 2 (trainning best checkpoint again for 200 epochs)
Positional encoding frequencies: 10 (positions), 4 (directions)
Network size: 256 hidden units

Inference

For rendering novel views:

python source/inference.py

The inference script:

Loads trained model weights
Generates novel camera trajectories
Renders images from new viewpoints
Creates a GIF of the rendered views

Dataset

This implementation uses the synthetic NeRF dataset (specifically the 'chair' scene). The dataset should be organized as:

nerf_synthetic/
└── chair/
    ├── train/
    │   └── *.png
    ├── test/
    │   └── *.png
    ├── transforms_train.json
    └── transforms_test.json

Implementation Details

Model Architecture

MLP-based architecture with skip connections
Separate branches for density and color prediction
Positional encoding for better high-frequency detail

Volume Rendering

Ray sampling with stratified sampling
Efficient chunk-based processing
Alpha compositing for final color computation

Training Features

Memory-efficient batch processing
Automatic mixed precision for faster training
Gradient clipping for training stability
Regular checkpointing for experiment tracking

Technical Challenges & Learning Experience

Key Challenges Overcome:

View Direction Handling:
- Initially struggled with proper view direction incorporation
- Solved by normalizing ray directions and ensuring correct tensor dimensions
- Learned importance of consistent coordinate systems
Training Stability:
- Early implementations had convergence issues
- Improved through:
  - Better learning rate scheduling (from constant to exponential decay)
  - Gradient clipping to prevent exploding gradients
  - Proper batch size and chunk size tuning
Memory Management:
- Initial implementation ran out of memory on GPU
- Solutions implemented:
  - Chunked ray processing
  - Efficient tensor operations
  - Better memory cleanup during training
Volume Rendering (AI generated):
- Complex implementation of differentiable volume rendering
- Learned about:
  - Alpha compositing
  - Importance of proper density scaling
  - Efficient sampling strategies

Key Learnings:

Neural Rendering Concepts:
- Deep understanding of volumetric rendering
- Importance of positional encoding for high-frequency details
- Role of view directions in capturing specular effects
Implementation Skills:
- Efficient PyTorch tensor operations
- GPU memory optimization techniques
- Importance of proper data preprocessing
- Debugging complex neural architectures
Best Practices:
- Importance of modular code structure
- Value of comprehensive logging
- Need for regular checkpointing
- Benefits of clean tensor management

Tips for Best Results

Memory Management:
- Adjust chunk size based on your GPU memory
- Clear unused tensors during training
- Use gradient checkpointing for larger scenes
Training Stability:
- Start with a lower learning rate (1e-4)
- Use gradient clipping (max_norm=0.1)
- Monitor the running average loss
Quality Improvements:
- Increase sampling points for better quality
- Adjust near/far bounds based on scene
- Fine-tune positional encoding frequencies

Hardware Requirements & Training Time

This implementation has been tested on:

NVIDIA T4 GPU
Training time: ~3 hours for 200 epochs on the chair dataset (for 400 epochs, took around 5-6 hrs)
Inference time: ~2-3 seconds/image for rendering a complete 360° view

Note: Performance can vary based on:

Scene complexity
Number of rays per batch
Sampling points per ray

Acknowledgments

This implementation is based on the original NeRF paper: "NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis" Ben Mildenhall, Pratul P. Srinivasan, Matthew Tancik, Jonathan T. Barron, Ravi Ramamoorthi, Ren Ng ECCV 2020

Name	Name	Last commit message	Last commit date
Latest commit Arshad221b Rename rederer.py to renderer.py Apr 23, 2025 f06d28a · Apr 23, 2025 History 19 Commits
results	results	Create nerf_test_views_updated.gif	Apr 10, 2025
screenshot	screenshot	screenshots	Apr 10, 2025
source	source	Rename rederer.py to renderer.py	Apr 23, 2025
LICENSE	LICENSE	Create LICENSE	Apr 10, 2025
README.md	README.md	Merge branch 'main' of https://github.com/Arshad221b/NeRF-from-scratch	Apr 10, 2025
requirements.txt	requirements.txt	requirement.txt	Apr 10, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

NeRF from Scratch

Overview

Requirements

Project Structure

Training

Inference

Dataset

Implementation Details

Model Architecture

Volume Rendering

Training Features

Technical Challenges & Learning Experience

Key Challenges Overcome:

Key Learnings:

Tips for Best Results

Hardware Requirements & Training Time

Acknowledgments

About

Releases

Packages

Languages

License

Arshad221b/NeRF-from-scratch

Folders and files

Latest commit

History

Repository files navigation

NeRF from Scratch

Overview

Requirements

Project Structure

Training

Inference

Dataset

Implementation Details

Model Architecture

Volume Rendering

Training Features

Technical Challenges & Learning Experience

Key Challenges Overcome:

Key Learnings:

Tips for Best Results

Hardware Requirements & Training Time

Acknowledgments

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages