res-dedup

Fast file-level duplication scanner for Windows, with FileId awareness.

This tool was mainly designed for (and benchmarked on) SSD. Parallel sequential read streams are impossible for HDD.

Benchmark

OS: Windows 11 24H2 / Revision patched
CPU: i7-12700H
SSD: Acer GM7000
Cache Size: 256 KiB
Concurrency: 128
Benchmark tool: perfmon.exe

Item	Average	Peak	Note
Read op/s	22162.8	25433.4
res-dedup	3839 MiB/s	3951 MiB/s	72.9GiB, 56548 files
res-dedup	5008 MiB/s	5256 MiB/s	55.0GiB, 1563 files
res-dedup	1708 MiB/s	1755 MiB/s	4GiB, single file 16MiB buffer no parallel read
SEQ1M Q8T1 ¹	6851 MiB/s	--	Single 1GiB file
SEQ1M Q8T1 ¹	4432 MiB/s	--	Single 4GiB file

TODO

Linux support (with getdents64)

Usage

See -h / --help.

Output format

JSON Lines of {"source": string, "other": string}

source is the path to first file found with the same hash with other.

Purpose

Reasonbly fast & lightweight file duplication scanner.

By design, hard links are treated as non-duplication.

Non-purpose

Replacement of general & powerful deduplication tools like jdupes

Bahaviour

When you create a hard link on the NTFS file system,

the file attribute information in the directory entry is refreshed only when the file is opened,

or when GetFileInformationByHandle is called with the handle of a specific file. ²

Thus, this project would never care about file attributes, but the file content itself.

Media similarity is also out of scope.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
.github/workflows		.github/workflows
src		src
.gitignore		.gitignore
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

res-dedup

Benchmark

TODO

Usage

Output format

Purpose

Non-purpose

Bahaviour

About

Uh oh!

Releases 11

Packages

Languages

License

mokurin000/res-dedup

Folders and files

Latest commit

History

Repository files navigation

res-dedup

Benchmark

TODO

Usage

Output format

Purpose

Non-purpose

Bahaviour

Footnotes

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 11

Packages 0

Languages

Packages