TransUNet Based Autonomous Vehicle Scene Segmentation

Overview

This project implements automatic scene segmentation using deep learning architectures: UNET CNN, Transformer-UNET, and Swin Transformer UNET. It aims to provide accurate pixel-wise segmentation for applications in medical imaging, autonomous vehicles, and urban planning. Our aim is to compare and predict the hypothesis if Transformers are better than Convolution Neural Networks or not.

Key Features

Implements traditional UNET for pixel-wise segmentation.
Integrates Transformer-UNET to enhance global feature learning.
Leverages Swin Transformer UNET for hierarchical and efficient segmentation.
Supports training and inference on custom datasets.
GPU acceleration for faster computations.

File Structure

├── configs/
|   ├── swin_tiny_patch4_window7_224_lite.yaml
|   ├── config.py
├── networks/              
│   ├── swin_transformer_unet_skip_expand_decoder_sys.py
│   ├── UNet.py
│   ├── UNetparts.py
|   ├── vision_transformer.py
|   ├── vit_segs_config.py
|   ├── vit_segs_modelling.py
|   ├── vit_segs_modelling_resnet_skip.py
├── preprocessing/
|   ├── AugmentedKITTIDataset_unet.py # data preprocess for UNet
|   ├── AugmentedKITTIDataset.py   # data preprocess for transUNet and swin-UNet
├── train/
|   ├──train_UNET_Cnn.py               # Training UNET CNN script
|   ├──train_Transunet.py              # Training TransUNET script
|   ├── train_swinUNET.py               # Training SwinTransUNET script
├── utils.py
├── train_parallel.sh # Bash Script for training
├── inference.py           # Inference script
├── requirements.txt       # Dependency list
└── README.md              # Project documentation

To run the scripts

Requirements - Libraries

python==3.12.0
torch>=1.10.0
albumentations>=1.0.0
albumentations[imgaug]
numpy>=1.21.0
opencv-python>=4.5.0
matplotlib>=3.4.0
tqdm>=4.60.0
kagglehub

Dataset

import kagglehub


path = kagglehub.dataset_download("klemenko/kitti-dataset")
print("Path to model files:", path)

On the Terminal, this code will depict the Path to the dataset:

Command Line Instructions to run

INSTALL CONDA/MINICONDA and follow the below process -

1. Activate Conda Environment

conda activate YOUR_ENV

2. Install all the dependencies

pip install requirements.txt

3. For running Bash File

chmod +x train_parallel.sh
./train_parallel.sh

4. Visualizing Bash file logs

tail -f logs/unet/training.log
tail -f logs/transunet/training.log
tail -f logs/swinunet/training.log

5. If required, individual training

1. cd train
2. python train_UNET_Cnn.py  # FOR UNET CNN
3. python train_Transunet.py  # FOR TRANSFORMER UNET
4. python train_swinUNET.py  # FOR SWIN-TRANSFORMER UNET

RESULTS

TESTING

1. PREDICTION 1

1. PREDICTION 2

1. PREDICTION 3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TransUNet Based Autonomous Vehicle Scene Segmentation

Overview

Key Features

File Structure

To run the scripts

Requirements - Libraries

Dataset

Command Line Instructions to run

INSTALL CONDA/MINICONDA and follow the below process -

RESULTS

TESTING

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 64 Commits
Images		Images
configs		configs
networks		networks
preprocessing		preprocessing
train		train
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
test.py		test.py
train_parallel.sh		train_parallel.sh
utils.py		utils.py

Folders and files

Latest commit

History

Repository files navigation

TransUNet Based Autonomous Vehicle Scene Segmentation

Overview

Key Features

File Structure

To run the scripts

Requirements - Libraries

Dataset

Command Line Instructions to run

INSTALL CONDA/MINICONDA and follow the below process -

RESULTS

TESTING

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages