summaryrefslogtreecommitdiff
path: root/README
AgeCommit message (Collapse)Author
2020-10-25use python3-pil instead of removed python3-imagingHelmut Grohne
2020-02-16drop support for Python 2.xHelmut Grohne
2016-05-23remove curl dependencyHelmut Grohne
Teach importpkg how to download urls using urlopen and thus remove the need for invoking curl.
2013-08-01support hashing gif imagesHelmut Grohne
* Rename "image_sha512" to "png_sha512". * dedup.image.ImageHash is now a base class for image hashes such as PNGHash and GIFHash. * Enable both hashes in importpkg. * Fix README. * Add new hash combinations to webapp. * Add "gif file not named *.gif" to issues in update_sharing. * Add redirect for "image_sha512" to webapp for backwards compatibility.
2013-07-27move templates to dedup packageHelmut Grohne
They cluttered webapp.py and now vim can give proper highlighting for the templates.
2013-07-26Merge branch functionidHelmut Grohne
Actual savings on the full data set are around 7%. Conflicts: README
2013-07-25README: foo.PNG is also a valid png nameHelmut Grohne
2013-07-23README: fix typo in queryHelmut Grohne
2013-07-23adapt queries in README to new schemaHelmut Grohne
2013-07-10schema: reference package table by integer keyHelmut Grohne
One approach to improve performance is to reduce the database size. A package name takes up 15 bytes in average. A number of a package takes up two bytes. Multiply that difference with the number of references and it should be noticeably. A small test set show a reduction by 10%.
2013-07-03README: explain update_sharing.pyHelmut Grohne
2013-06-10split the import phase to a yaml streamHelmut Grohne
importpkg.py now emits a yaml stream instead of updating the database. The acutual updating now happens in readyaml.py. In this process autoimport.py was significantly reworked to import packages in parallel.
2013-04-08README: improve query after schemachangeHelmut Grohne
2013-03-10README: update queries to match content table splitHelmut Grohne
2013-03-07README: explain queriesHelmut Grohne
2013-03-06README: added interesting queryHelmut Grohne
2013-03-02update READMEHelmut Grohne
* Tell about schema.sql. * Explain WAL.
2013-02-25README: another interesting queryHelmut Grohne
2013-02-24hash image contentsHelmut Grohne
2013-02-24README: fix mistakeHelmut Grohne
2013-02-21added READMEHelmut Grohne