Duplicates not working v6.73
Posted: Sat Sep 23, 2017 5:42 pm
I'm a long time user of NewsBin. I have been running v6.62 for quite a while and updated to v6.73 to get the filter by poster capability and block downloads in groups.
But I have noticed a problem with duplicates.
I have always downloaded duplicates. I have the "Use Duplicate Detector" unchecked, "Folder Dup Bypass" checked, "Auto Rename" checked, "Copy Style" unchecked.
In the past if I downloaded the same jpg (for example) that was in multiple groups (each group has it's own folder) then I would get a jpg in each folder. Also if I downloaded (bypass filters) the file again in the same group/folder it would be downloaded with suffix 001, 002, etc. and the jpg would show in the thumbnail viewer. If different posts in a group had the same filename/sig then it would be downloaded with the suffix 001, 002, etc as well.
Under v6.73:
if I download the jpg a second time using bypass filters it does not show up in the folder as a 001 and does not display in thumbnail viewer.
It does download one time for each group/folder but again, if I download bypass the file again it does not show up.
I know signature checking is not being stored because my signature db is just a few KB and the date of the file is very old.
As a test I deleted the jpg from the group folder and downloaded it and it did appear in the folder.
I then deleted the jpg, renamed a different jpg to that file name (the file had a different size) and downloaded the file. It did show up in the folder as a 001 file and displayed in the thumbnail viewer. I did this multiple times and every time it was downloaded with the next suffix.
So NewsBin seems to be checking if the file exists already in that folder and if it is the same size (which is not a reliable test for duplicates) or maybe is doing a sig check of the actual file on disk and comparing it to the new downloaded file and if it is the same does not do the duplicate rename... sort of as if the "Use Duplicate Detector" was active. Again, I know my sig file is not being used as it is not growing and has an old date.
For my tests I used the same post over and over. I don't know what would happen if different posts in the same group/folder had the same filename/sig... would the download occur with 001 or not. In the past it would have been downloaded with the suffix 001, 002, etc.
This has to be a bug.
Thanks
-------------
Quade, I just saw this post...
Disable duplicate file detection?
Is it possible to disable the duplicate file detection?
I'm constantly getting false positives.
VERSION : 6.72: 00 A8 C7 15 FE 9C
bevo n00b
Unread postby Quade ยป Wed Aug 23, 2017 10:11 pm
Yes, It's in the advanced options.
Keep in mind if Newsbin detects the same filename on disk and either it's identical or the first whole chunk is identical, it won't make a new file with the same name.
END POST
Quade, Concerning your last sentence I don't remember it working that way if Duplicate Detection is turn off... especially if you use Download Bypass Filters. So is that the problem... Bypass Filters is not working or has been changed in how it works? Actually I don't have a problem if that is the case as long as the thumbnail would be displayed of the existing file. I am wanting to see the thumbnail... I don't care if it downloads again or not. Of course we both know this only works for the first file saved with filename. If the filename is used by another file with a different sig it would have been saved with a suffix so NewsBin would not know that and in those cases the file would always download with a new suffix. Right?
But I have noticed a problem with duplicates.
I have always downloaded duplicates. I have the "Use Duplicate Detector" unchecked, "Folder Dup Bypass" checked, "Auto Rename" checked, "Copy Style" unchecked.
In the past if I downloaded the same jpg (for example) that was in multiple groups (each group has it's own folder) then I would get a jpg in each folder. Also if I downloaded (bypass filters) the file again in the same group/folder it would be downloaded with suffix 001, 002, etc. and the jpg would show in the thumbnail viewer. If different posts in a group had the same filename/sig then it would be downloaded with the suffix 001, 002, etc as well.
Under v6.73:
if I download the jpg a second time using bypass filters it does not show up in the folder as a 001 and does not display in thumbnail viewer.
It does download one time for each group/folder but again, if I download bypass the file again it does not show up.
I know signature checking is not being stored because my signature db is just a few KB and the date of the file is very old.
As a test I deleted the jpg from the group folder and downloaded it and it did appear in the folder.
I then deleted the jpg, renamed a different jpg to that file name (the file had a different size) and downloaded the file. It did show up in the folder as a 001 file and displayed in the thumbnail viewer. I did this multiple times and every time it was downloaded with the next suffix.
So NewsBin seems to be checking if the file exists already in that folder and if it is the same size (which is not a reliable test for duplicates) or maybe is doing a sig check of the actual file on disk and comparing it to the new downloaded file and if it is the same does not do the duplicate rename... sort of as if the "Use Duplicate Detector" was active. Again, I know my sig file is not being used as it is not growing and has an old date.
For my tests I used the same post over and over. I don't know what would happen if different posts in the same group/folder had the same filename/sig... would the download occur with 001 or not. In the past it would have been downloaded with the suffix 001, 002, etc.
This has to be a bug.
Thanks
-------------
Quade, I just saw this post...
Disable duplicate file detection?
Is it possible to disable the duplicate file detection?
I'm constantly getting false positives.
VERSION : 6.72: 00 A8 C7 15 FE 9C
bevo n00b
Unread postby Quade ยป Wed Aug 23, 2017 10:11 pm
Yes, It's in the advanced options.
Keep in mind if Newsbin detects the same filename on disk and either it's identical or the first whole chunk is identical, it won't make a new file with the same name.
END POST
Quade, Concerning your last sentence I don't remember it working that way if Duplicate Detection is turn off... especially if you use Download Bypass Filters. So is that the problem... Bypass Filters is not working or has been changed in how it works? Actually I don't have a problem if that is the case as long as the thumbnail would be displayed of the existing file. I am wanting to see the thumbnail... I don't care if it downloads again or not. Of course we both know this only works for the first file saved with filename. If the filename is used by another file with a different sig it would have been saved with a suffix so NewsBin would not know that and in those cases the file would always download with a new suffix. Right?