Age | Commit message (Collapse) | Author | |
---|---|---|---|
2016-04-28 | decouple a function decompress out of decompress_tar | Helmut Grohne | |
Building on the previous commit, add a decompress function that turns a compressed filelike into a decompressed filelike. Use it to decouple the decompression step. | |||
2016-04-28 | extend functionality of DecompressedStream | Helmut Grohne | |
It now supports: * tell() * seek(absolute_position), forward only * close() * closed This is sufficient for putting it as a fileobj into tarfile.TarFile. By doing so we can decouple decompression from tar processing, which eases papering over the Python 2.x vs Python 3.x differences. | |||
2016-04-19 | add a class DebExtractor for guiding feature extraction | Helmut Grohne | |
It is supposed to separate the parsing of Debian packages (understanding how the format works) from the actual feature extraction. Its goal is to simplify writing custom extractors for different feature sets. | |||
2016-04-16 | add a validate method to HashedStream | Helmut Grohne | |
2015-04-16 | process_control: do not encode to ascii | Helmut Grohne | |
Otherwise the yaml will contain binary strings on py3k which end up as binary data in the sqlite database. In py2, yaml can handle those unicode objects just fine. | |||
2015-04-16 | element access on bytes yields int in py3k | Helmut Grohne | |
2015-04-16 | zlib.crc32 behaves inconsistently on py2 vs py3 | Helmut Grohne | |
zlib.crc32 returns a int32_t on py2 and a uint32_t on py3. | |||
2015-04-16 | there is no itertools.imap in py3k | Helmut Grohne | |
2015-04-16 | distinguish bytes from unicode for py3k | Helmut Grohne | |
2014-05-11 | importpkg: add support for control.tar and control.tar.xz | Guillem Jover | |
dpkg supports those since 1.17.6. Signed-off-by: Guillem Jover <guillem@debian.org> | |||
2014-05-11 | dedup.arreader: remove trailing slash from ar members | Guillem Jover | |
The GNU ar format adds a trailing slash to the member names, normalize the member names to take this into account. Signed-off-by: Guillem Jover <guillem@debian.org> | |||
2014-02-23 | spell check comments | Helmut Grohne | |
2014-02-23 | fix spelling mistake | Helmut Grohne | |
Reported-By: Stefan Kaltenbrunner | |||
2014-02-19 | blacklist content rather than hashes | Helmut Grohne | |
Otherwise the gzip hash cannot tell the empty stream and the compressed empty stream apart. | |||
2014-02-19 | GzipDecompressor: don't treat checksum as garbage trailer | Helmut Grohne | |
2014-02-19 | DecompressedHash should fail on trailing input | Helmut Grohne | |
Otherwise all files smaller than 10 bytes are successfully hashed to the hash of the empty input when using the GzipDecompressor. Reported-By: Olly Betts | |||
2013-10-03 | work around python-debian's #670679 | Helmut Grohne | |
2013-09-04 | webapp: serve static files from /static | Helmut Grohne | |
2013-09-02 | importpkg: move library-like parts to dedup.debpkg | Helmut Grohne | |
2013-08-16 | make debian version_compare available in sql | Helmut Grohne | |
2013-08-16 | webapp templates: add an anchor for file issues | Helmut Grohne | |
2013-08-01 | support hashing gif images | Helmut Grohne | |
* Rename "image_sha512" to "png_sha512". * dedup.image.ImageHash is now a base class for image hashes such as PNGHash and GIFHash. * Enable both hashes in importpkg. * Fix README. * Add new hash combinations to webapp. * Add "gif file not named *.gif" to issues in update_sharing. * Add redirect for "image_sha512" to webapp for backwards compatibility. | |||
2013-07-30 | templates/binary: space between package and compare | Helmut Grohne | |
2013-07-30 | templates: wiki.d.o redirects to https now | Helmut Grohne | |
2013-07-27 | also move the static directory into the dedup package | Helmut Grohne | |
2013-07-27 | move templates to dedup package | Helmut Grohne | |
They cluttered webapp.py and now vim can give proper highlighting for the templates. | |||
2013-07-26 | verify package hashes when importing via http | Helmut Grohne | |
2013-05-27 | dedup.image: img.convert can also raise that crazy stuff | Helmut Grohne | |
2013-03-18 | dedup.image: mask errors from PIL | Helmut Grohne | |
2013-03-12 | dedup.arreader: missing bytes marker | Helmut Grohne | |
2013-03-12 | move ArReader from importpkg to dedup.arreader | Helmut Grohne | |
Also document it. | |||
2013-03-02 | move fetchiter from webapp to dedup.utils | Helmut Grohne | |
2013-02-24 | hash image contents | Helmut Grohne | |
2013-02-21 | move compression functions to module dedup.compression | Helmut Grohne | |
2013-02-21 | do not track byted compiled python files | Helmut Grohne | |
2013-02-21 | move hashing functions to module dedup.hashing | Helmut Grohne | |