Point-Supervised Japanese Fingerspelling Localization via HR-Pro and Contrastive Learning

Official PyTorch implementation of our paper at ICCV 2025 1st Multimodal Sign Language Recognition (MSLR) Workshop

This repository presents a point-supervised temporal localization pipeline for Japanese fingerspelling. We enhance HR-Pro with three key components:

A transformer-based encoder (VideoMAE v2)
SimCLR-style point-supervised contrastive learning (Point-Sup. CL)
Joint angle features derived from MediaPipe Hands

Directory Structure

.
├── feature_extraction # Feature extraction pipelines (Angular/I3D)
├── hrpro # HR-Pro-based temporal localization with point annotations
└── videomae # Point-supervised contrastive learning with VideoMAE v2

Each directory includes its own README.md with detailed instructions for setup and execution.

Requirements

uv (or your preferred package manager)
CUDA>=12.4
OpenCV

Dataset

We have released ub-MOJI, a Japanese fingerspelling video dataset with point-level annotations, available via Hugging Face.

Results

Localization performance (mean Average Precision across tIoU thresholds) on the ub-MOJI dataset:

Model	mAP@0.1–0.5	mAP@0.3–0.7	mAP@0.1–0.7
I3D (RGB + Flow)	57.6%	50.8%	52.9%
I3D + Angular	90.8%	80.4%	84.0%
VideoMAE v2	62.9%	56.5%	58.6%
VideoMAE v2 + Point-Sup. CL	93.4%	78.9%	83.6%
VideoMAE v2 + Point-Sup. CL + Angular	90.9%	79.6%	83.7%

Checkpoints

Download our trained checkpoints from Hugging Face.

Contributing

For questions or collaborations, feel free to open an Issue or Pull Request.

License

This code is released under the MIT License.
Please refer to the dataset repository for dataset-specific licensing terms.

Authors

Ryota Murai (Code Owner)
Naoto Tsuta
Duk Shin
Yousun Kang

Citation

TBD

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
.github		.github
feature_extraction		feature_extraction
hrpro		hrpro
videomae		videomae
.gitignore		.gitignore
.gitmodules		.gitmodules
LICENSE		LICENSE
NOTICE.md		NOTICE.md
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Point-Supervised Japanese Fingerspelling Localization via HR-Pro and Contrastive Learning

Directory Structure

Requirements

Dataset

Results

Checkpoints

Contributing

License

Authors

Citation

About

Uh oh!

Releases

Uh oh!

Languages

License

tpu-kanglabs/ub-hrpro

Folders and files

Latest commit

History

Repository files navigation

Point-Supervised Japanese Fingerspelling Localization via HR-Pro and Contrastive Learning

Directory Structure

Requirements

Dataset

Results

Checkpoints

Contributing

License

Authors

Citation

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Uh oh!

Languages