Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Option to specify specific offset when finding dupes #137

Open
negativeExponent opened this issue Jun 30, 2023 · 1 comment
Open

Option to specify specific offset when finding dupes #137

negativeExponent opened this issue Jun 30, 2023 · 1 comment

Comments

@negativeExponent
Copy link

for example, skip or ignore 100 bytes from the start of the file. this would be useful for files with a fixed header info or similar.

@chrisulbrich
Copy link

I've got a similar challenge.
I've got many thousands of raw images (*.ARW) from my Sony camera. The header of these files in most cases is identical so the first bytes scan usually doesn't remove any files from list. Unfortunately in most cases the last bytes are identical, too.
As a result usually the checksums of all files must be calculated, which is very expensive.

It would be very helpful to have options to define how many bytes the first or last bytes should be.
I think it would be more versatile to read more bytes than using a offset in case if there are some files of other types (sidecar and project files in my case) between the big amount of files.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants