Final Year Project ยท 1st Class Honours

A#X0$M9K@1 Navigation

Deep Reinforcement Learning in Complex Traffic Environments

PPO Algorithm
Unity Simulation
C# Implementation
01

System Overview

An intelligent autonomous vehicle system designed to navigate complex, dynamic traffic environments. Unlike traditional rule-based systems, this project utilises Deep Reinforcement Learning to enable the agent to learn optimal driving policies through trial and error.

Simulation Environment
02

The Environment

Unity Simulation Engine

A custom-built, realistic 3D environment in Unity. It simulates:

  • Dynamic Traffic Lights
  • Complex Intersections
  • Multi-Vehicle Interaction
  • Sensor Ray-casting
03

The Brain

Proximal Policy Optimisation (PPO)

The agent utilises PPO, a state-of-the-art policy gradient method, to master continuous control. It processes discrete inputs (sensors, speed, traffic state) to output continuous steering and acceleration actions.

04

Reward Logic

A modular reward architecture guides the learning process. The agent balances multiple objectives:

+ Velocity Maintenance
+ Lane Adherence
! Collision (Terminal)
! Red Light Violation
... Additional rewards and penalties

Interactive Demo

Experience the simulation logic in browser.

LAUNCH SIMULATION

Github

Explore the C# scripts and ML configurations.

ACCESS REPOSITORY

Download

Read the complete dissertation paper.

DOWNLOAD PDF
100%
DOWNLOADING DOCUMENT...