+
Skip to content

mokurin000/res-dedup

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

15 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

res-dedup

Fast file-level duplication scanner for Windows, with FileId awareness.

This tool was mainly designed for (and benchmarked on) SSD. Parallel sequential read streams are impossible for HDD.

Benchmark

  • OS: Windows 11 24H2 / Revision patched
  • CPU: i7-12700H
  • SSD: Acer GM7000
  • Cache Size: 256 KiB
  • Concurrency: 128
  • Benchmark tool: perfmon.exe
Item Average Peak Note
Read op/s 22162.8 25433.4
res-dedup 3839 MiB/s 3951 MiB/s 72.9GiB, 56548 files
res-dedup 5008 MiB/s 5256 MiB/s 55.0GiB, 1563 files
res-dedup 1708 MiB/s 1755 MiB/s 4GiB, single file
16MiB buffer
no parallel read
SEQ1M Q8T1 1 6851 MiB/s -- Single 1GiB file
SEQ1M Q8T1 1 4432 MiB/s -- Single 4GiB file

TODO

  • Linux support (with getdents64)

Usage

See -h / --help.

Output format

JSON Lines of {"source": string, "other": string}

source is the path to first file found with the same hash with other.

Purpose

Reasonbly fast & lightweight file duplication scanner.

By design, hard links are treated as non-duplication.

Non-purpose

Replacement of general & powerful deduplication tools like jdupes

Bahaviour

When you create a hard link on the NTFS file system,

the file attribute information in the directory entry is refreshed only when the file is opened,

or when GetFileInformationByHandle is called with the handle of a specific file. 2

Thus, this project would never care about file attributes, but the file content itself.

Media similarity is also out of scope.

Footnotes

  1. https://sourceforge.net/projects/crystaldiskmark/files/9.0.1/CrystalDiskMark9_0_1Aoi.exe/download 2

  2. https://learn.microsoft.com/en-us/windows/win32/api/WinBase/nf-winbase-createhardlinka#remarks

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages

点击 这是indexloc提供的php浏览器服务,不要输入任何密码和下载