Delete duplicated entries
@gpoo
Submitted by Germán Poo-Caamaño Link to original bug (#705252)
Description
Way to "delete duplicated entries" in a database?
When I am merging the different databases that I have, sometimes I findmyself re-merging database B twice to A, so that I have duplicated entries for all B in the merged database.
Is there a way to eliminate the duplicated (and identic, if you haven'tedited them) entries right now?
Gobry
This is possibly something that does not need to be deeply rooted in the core api, so you can start by writing some logic code that can provide potential matches (ie items that might be identical, given some metric). This can basically be a fresh module in Pyblio/. Keep in mind this is sth that will have heavy interactions with a GUI. Then, in Pyblio/GnomeUI, you'll probably need to create a new dialog, able to propose your matches in a convenient way. Here too, this can be a fresh module, possibly reusing GnomeUI/Entry.py to display the entries to compare.
John
And if you're really enthusiastic, Peter pointed out some literature onduplicate identification in his design docs -- they're on the websitesomewhere. Of course, a duplicate feature that exists is a lot moreuseful than one that doesn't, so if you don't have time to read that kindof stuff, go ahead and implement something that works anyway! Even ifsomebody improves on your duplicate-finding algorithm later, any GUI codeyou write will likely get re-used.
https://sourceforge.net/p/pybliographer/feature-requests/11/