summaryrefslogtreecommitdiff
AgeCommit message (Collapse)Author
2013-07-03README: fix example query for hash BLOBsblobhashHelmut Grohne
Otherwise the sqlite tool just outputs them as binary to the terminal.
2013-07-03store hash values as sqlite BLOBHelmut Grohne
They were previously hex encoded, so this should cut the space consumed by hashes in half. A first benchmark indicates that the savings in database size are in the order of 30%.
2013-06-23Merge branch yamlimportHelmut Grohne
+ Way faster on multiple cores. + More reliable, cause http connections do not time out when the db blocks. - Way slower on single core with contended io path. No clue why. Still update_sharing.py makes up the bulk of processing time.
2013-06-19webapp: fix hash example link after git uploadHelmut Grohne
The git binary changed and so did its hash. Choosing a more stable example now: The GPL-3.
2013-06-11autoimport: don't fork for readyamlHelmut Grohne
This appears to be a huge performance boost.
2013-06-11autoimport: support processing individual filesHelmut Grohne
This gets back the original functionality of importpkg.py.
2013-06-10split the import phase to a yaml streamHelmut Grohne
importpkg.py now emits a yaml stream instead of updating the database. The acutual updating now happens in readyaml.py. In this process autoimport.py was significantly reworked to import packages in parallel.
2013-05-27dedup.image: img.convert can also raise that crazy stuffHelmut Grohne
2013-05-09webapp: declare html5 and utf-8Helmut Grohne
2013-05-09webapp: enrich comparison page with version infoHelmut Grohne
2013-05-08fix attribution of logoHelmut Grohne
I remembered the wrong name. The logo was made by Sune Vuorela.
2013-05-05webapp: markup error in /source templateHelmut Grohne
2013-05-05webapp: validator complained about <link> with sizesHelmut Grohne
2013-05-05webapp: reference favicon from base.htmlHelmut Grohne
2013-05-05added favicon.icoHelmut Grohne
Authored: Cyril Brulebois
2013-05-02webapp: use jinja's filesizeformatHelmut Grohne
Except it doesn't work, so replace it with our version. At least we might be able to drop this code in a future update.
2013-05-02webapp: reduce size of comparison outputHelmut Grohne
Only add rowspan when it carries a meaning.
2013-04-27webapp: add a css class binary-packageHelmut Grohne
2013-04-25webapp: total_size is None if num_files is 0Helmut Grohne
2013-04-25webapp: color filenames when hovering themHelmut Grohne
2013-04-25webapp: turn the <br> after filename into a styleHelmut Grohne
2013-04-25move css to /style.cssHelmut Grohne
2013-04-25webapp: make filenames css styleableHelmut Grohne
2013-04-25webapp: top-align fields in /compare pagesHelmut Grohne
Suggested by Paul Wise.
2013-04-25fix markup in base.htmlHelmut Grohne
2013-04-24implement the /compare/pkg1/pkg2 page differentlyHelmut Grohne
The original version had two major drawbacks: 1) The SQL query used would cause a btree sort, so the time waiting for the first output was rather long. 2) For packages with many equal files, the output would grow with O(n^2). Thanks to the suggestions by Christine Grohne and Klaus Aehlig. The approach now groups files in package1 by their main hash value (sha512). It also does some work SQL was designed to solve manually now. To speed up page generation a new caching table was added identifying which files have corresponding shared files.
2013-04-14webapp: added some useful notesHelmut Grohne
2013-04-13base.html: add link to wiki.debian.orgHelmut Grohne
2013-04-08README: improve query after schemachangeHelmut Grohne
2013-03-26webapp: fix problem from the previous mergeHelmut Grohne
2013-03-26Merge branch schemachangeHelmut Grohne
2013-03-20webapp: report correct sizesHelmut Grohne
2013-03-20webapp: remove broken assertHelmut Grohne
Fails on long inputs.
2013-03-18dedup.image: mask errors from PILHelmut Grohne
2013-03-12dedup.arreader: missing bytes markerHelmut Grohne
2013-03-12move ArReader from importpkg to dedup.arreaderHelmut Grohne
Also document it.
2013-03-10README: update queries to match content table splitHelmut Grohne
2013-03-09split content table to a hash tableHelmut Grohne
In the old content table (package, filename, size) would be the same for multiple hash functions. Now the schema represents that each file has precisely one size, but multiple hashes.
2013-03-09webapp: drop unused function compute_sharedstatsHelmut Grohne
The sharing table works great and I don't want to adapt it for the next step in the schema change.
2013-03-07use "ON DELETE CASCADE" clausesHelmut Grohne
2013-03-07enable enforcing foreign keysHelmut Grohne
2013-03-07schema.sql: remove unsatisfiable foreign keyHelmut Grohne
In the dependency table we will insert dependencies on packages which are not tracked. This happens during initial import and for virtual packages. Therefore the "required" column cannot be a foreign key.
2013-03-07schema.sql: annotat foreign keys of sharingHelmut Grohne
2013-03-07integrate the source table into the package tableHelmut Grohne
2013-03-07README: explain queriesHelmut Grohne
2013-03-06README: added interesting queryHelmut Grohne
2013-03-05webapp: added /source/<pkg> pageHelmut Grohne
2013-03-05webapp: helper function function_combinationHelmut Grohne
2013-03-05importpkg: source header may contain a versionHelmut Grohne
2013-03-04webapp: fix index templateHelmut Grohne
Apparently not all browsers understand <a ... /> in all rendering modes.