Final Year Project · 1st Class Honours

A#X0$M9K@1 Navigation

Deep Reinforcement Learning in Complex Traffic Environments

PPO Algorithm

Unity Simulation

C# Implementation

System Overview

An intelligent autonomous vehicle system designed to navigate complex, dynamic traffic environments. Unlike traditional rule-based systems, this project utilises Deep Reinforcement Learning to enable the agent to learn optimal driving policies through trial and error.

The Environment

Unity Simulation Engine

A custom-built, realistic 3D environment in Unity. It simulates:

Dynamic Traffic Lights
Complex Intersections
Multi-Vehicle Interaction
Sensor Ray-casting

The Brain

Proximal Policy Optimisation (PPO)

The agent utilises PPO, a state-of-the-art policy gradient method, to master continuous control. It processes discrete inputs (sensors, speed, traffic state) to output continuous steering and acceleration actions.

Reward Logic

A modular reward architecture guides the learning process. The agent balances multiple objectives:

+ Velocity Maintenance

+ Lane Adherence

! Collision (Terminal)

! Red Light Violation

... Additional rewards and penalties

Interactive Demo

Experience the simulation logic in browser.

LAUNCH SIMULATION

Github

Explore the C# scripts and ML configurations.

ACCESS REPOSITORY

Download

Read the complete dissertation paper.

DOWNLOAD PDF

100%

DOWNLOADING DOCUMENT...