Research Dossier / 02

Human Body Parts Detection using Improved YOLOv5 with Multi-Layer Attention Network (MLA-NET)

Designed an enhanced YOLOv5-based detection framework integrated with a Multi-Layer Attention Network (MLA-NET) for accurate human body part detection in disaster rescue scenarios.

Short LabelMLA-NET YOLOv5

Institution: Research under Dr. Soumajit Pramanik, IIT Bhilai
Role: Computer Vision Researcher
Active Period: 2023 - Present

YOLOv5

PyTorch

OpenPose

Real-ESRGAN

Deep Learning

Method + iterations

Research steps, prototypes, and refinements along the way.

Developed a multi
layer attention module combining channel attention and pixel attention to improve feature refinement.
Automated ground
truth generation using OpenPose — based body joint estimation.
Enhanced low
resolution disaster footage using Real — ESRGAN super — resolution models.
Implemented frame extraction pipelines for video
to — image dataset creation.
Built augmentation sets targeting smoke/noise/blur artifacts common in rescue operations.
Achieved improved robustness in detecting partially occluded or irregularly posed victims.

Visual Notes

ATTENTION ARCHITECTURE

Channel + Pixel attention integration improving detection accuracy in complex rescue environments.

TRAINING PIPELINE

Dataset generation workflow from raw video streams to curated frame-level labels and model-ready samples.

INFERENCE UNDER STRESS

Validation runs across low-light, occluded, and compressed inputs to stress-test deployment resilience.

← Back to archive Next / Termx

Human Body Parts Detection using Improved YOLOv5 with Multi-Layer Attention Network (MLA-NET)

layer attention module combining channel attention and pixel attention to improve feature refinement.

truth generation using OpenPose — based body joint estimation.

resolution disaster footage using Real — ESRGAN super — resolution models.

to — image dataset creation.

Visual Notes

ATTENTION ARCHITECTURE

TRAINING PIPELINE

INFERENCE UNDER STRESS