summaryrefslogtreecommitdiff
path: root/importpkg.py
AgeCommit message (Collapse)Author
2015-04-16use binary stdin on py3kHelmut Grohne
2015-04-16distinguish bytes from unicode for py3kHelmut Grohne
2014-07-23importpkg: be more liberal in control file namingHelmut Grohne
While in current sid packages the control file in control.tar is always named "./control", some older packages name it "control".
2014-05-11importpkg: reduce copy&pasteHelmut Grohne
2014-05-11importpkg: add support for data.tar.lzmaGuillem Jover
Creating packages with lzma compression has been deprecated since dpkg 1.16.4, but there might be some of those in the wild and supporting them is strightforward when xz is already supported. Signed-off-by: Guillem Jover <guillem@debian.org>
2014-05-11importpkg: add support for control.tar and control.tar.xzGuillem Jover
dpkg supports those since 1.17.6. Signed-off-by: Guillem Jover <guillem@debian.org>
2014-02-23spell check commentsHelmut Grohne
2014-02-19blacklist content rather than hashesHelmut Grohne
Otherwise the gzip hash cannot tell the empty stream and the compressed empty stream apart.
2013-09-02importpkg: move library-like parts to dedup.debpkgHelmut Grohne
2013-08-19importpkg: don't blacklist boring gzip_sha512 hashesHelmut Grohne
* In practise there are very few compressed files with trivial hashes. * Blacklisting these values results in false positives in the gzip issues.
2013-08-01support hashing gif imagesHelmut Grohne
* Rename "image_sha512" to "png_sha512". * dedup.image.ImageHash is now a base class for image hashes such as PNGHash and GIFHash. * Enable both hashes in importpkg. * Fix README. * Add new hash combinations to webapp. * Add "gif file not named *.gif" to issues in update_sharing. * Add redirect for "image_sha512" to webapp for backwards compatibility.
2013-07-29importpkg.py: support uncompressed data.tarHelmut Grohne
2013-07-26verify package hashes when importing via httpHelmut Grohne
2013-07-12importpkg: simplify state logicHelmut Grohne
2013-07-12importpkg: split process_package to process_controlHelmut Grohne
2013-06-10split the import phase to a yaml streamHelmut Grohne
importpkg.py now emits a yaml stream instead of updating the database. The acutual updating now happens in readyaml.py. In this process autoimport.py was significantly reworked to import packages in parallel.
2013-03-26Merge branch schemachangeHelmut Grohne
2013-03-12move ArReader from importpkg to dedup.arreaderHelmut Grohne
Also document it.
2013-03-09split content table to a hash tableHelmut Grohne
In the old content table (package, filename, size) would be the same for multiple hash functions. Now the schema represents that each file has precisely one size, but multiple hashes.
2013-03-07enable enforcing foreign keysHelmut Grohne
2013-03-07integrate the source table into the package tableHelmut Grohne
2013-03-05importpkg: source header may contain a versionHelmut Grohne
2013-03-04importpkg: record the source package relationshipHelmut Grohne
2013-03-02move sql schema to a separate fileHelmut Grohne
2013-02-24hash image contentsHelmut Grohne
2013-02-23importpkg: ignore filenames with encoding errorsHelmut Grohne
2013-02-21move compression functions to module dedup.compressionHelmut Grohne
2013-02-21move hashing functions to module dedup.hashingHelmut Grohne
2013-02-21rename test.py to importpkg.pyHelmut Grohne