summaryrefslogtreecommitdiff
path: root/README
AgeCommit message (Collapse)Author
2013-08-02Merge branch master into sqlalchemyHelmut Grohne
This makes the sqlalchemy branch schema-compatible with master again. The biggest change on master was the introduction of the function table. It caused most of the conflicts. Note that webapp had one conflict not detected by git: The selecting of issues in show_package needed sqlalchemy conversion. Conflicts: README update_sharing.py webapp.py
2013-08-01support hashing gif imagesHelmut Grohne
* Rename "image_sha512" to "png_sha512". * dedup.image.ImageHash is now a base class for image hashes such as PNGHash and GIFHash. * Enable both hashes in importpkg. * Fix README. * Add new hash combinations to webapp. * Add "gif file not named *.gif" to issues in update_sharing. * Add redirect for "image_sha512" to webapp for backwards compatibility.
2013-07-27move templates to dedup packageHelmut Grohne
They cluttered webapp.py and now vim can give proper highlighting for the templates.
2013-07-26Merge branch functionidHelmut Grohne
Actual savings on the full data set are around 7%. Conflicts: README
2013-07-25README: foo.PNG is also a valid png nameHelmut Grohne
2013-07-23README: fix typo in queryHelmut Grohne
2013-07-23adapt queries in README to new schemaHelmut Grohne
2013-07-17Merge branch master into sqlalchemyHelmut Grohne
This basically pulls the packageid branch into sqlalchemy. The merge was complex, because many sql statements diverged. The merge brings us one step closer to supporting postgres, because an "INSERT OR REPLACE" was removed from readyaml.py in the packageid branch. Conflicts: update_sharing.py webapp.py
2013-07-10schema: reference package table by integer keyHelmut Grohne
One approach to improve performance is to reduce the database size. A package name takes up 15 bytes in average. A number of a package takes up two bytes. Multiply that difference with the number of references and it should be noticeably. A small test set show a reduction by 10%.
2013-07-03README: explain update_sharing.pyHelmut Grohne
2013-06-23Merge master into sqlalchemyHelmut Grohne
This is necessary to avoid severe merge conflicts when converting importpkg.py to sqlalchemy. The actual sql invocation has moved to a different file in master. Conflicts: README (diverged set of dependencies)
2013-06-13webapp: use sqlalchemyHelmut Grohne
* Arguably the interface is nicer. * Actually closes connections. => wal files get deleted. * Permits switching from sqlite to anything.
2013-06-10split the import phase to a yaml streamHelmut Grohne
importpkg.py now emits a yaml stream instead of updating the database. The acutual updating now happens in readyaml.py. In this process autoimport.py was significantly reworked to import packages in parallel.
2013-04-08README: improve query after schemachangeHelmut Grohne
2013-03-10README: update queries to match content table splitHelmut Grohne
2013-03-07README: explain queriesHelmut Grohne
2013-03-06README: added interesting queryHelmut Grohne
2013-03-02update READMEHelmut Grohne
* Tell about schema.sql. * Explain WAL.
2013-02-25README: another interesting queryHelmut Grohne
2013-02-24hash image contentsHelmut Grohne
2013-02-24README: fix mistakeHelmut Grohne
2013-02-21added READMEHelmut Grohne