Age | Commit message (Collapse) | Author |
|
Teach importpkg how to download urls using urlopen and thus remove the
need for invoking curl.
|
|
* Rename "image_sha512" to "png_sha512".
* dedup.image.ImageHash is now a base class for image hashes such as
PNGHash and GIFHash.
* Enable both hashes in importpkg.
* Fix README.
* Add new hash combinations to webapp.
* Add "gif file not named *.gif" to issues in update_sharing.
* Add redirect for "image_sha512" to webapp for backwards
compatibility.
|
|
They cluttered webapp.py and now vim can give proper highlighting for
the templates.
|
|
Actual savings on the full data set are around 7%.
Conflicts:
README
|
|
|
|
|
|
|
|
One approach to improve performance is to reduce the database size. A
package name takes up 15 bytes in average. A number of a package takes
up two bytes. Multiply that difference with the number of references and
it should be noticeably. A small test set show a reduction by 10%.
|
|
|
|
importpkg.py now emits a yaml stream instead of updating the database.
The acutual updating now happens in readyaml.py. In this process
autoimport.py was significantly reworked to import packages in parallel.
|
|
|
|
|
|
|
|
|
|
* Tell about schema.sql.
* Explain WAL.
|
|
|
|
|
|
|
|
|