diff options
author | Helmut Grohne <helmut@subdivi.de> | 2013-04-24 20:56:46 +0200 |
---|---|---|
committer | Helmut Grohne <helmut@subdivi.de> | 2013-04-24 21:00:20 +0200 |
commit | 94eb867119af05639691ec7990dcf2d6a956dd86 (patch) | |
tree | 6f33e5f2badf1b19182c718f46614869047516cb /dedup/compression.py | |
parent | d2b83735a4810cec7bf7c0dd6fb521498f104435 (diff) | |
download | debian-dedup-94eb867119af05639691ec7990dcf2d6a956dd86.tar.gz |
implement the /compare/pkg1/pkg2 page differently
The original version had two major drawbacks:
1) The SQL query used would cause a btree sort, so the time waiting
for the first output was rather long.
2) For packages with many equal files, the output would grow with
O(n^2).
Thanks to the suggestions by Christine Grohne and Klaus Aehlig. The
approach now groups files in package1 by their main hash value (sha512).
It also does some work SQL was designed to solve manually now. To speed
up page generation a new caching table was added identifying which files
have corresponding shared files.
Diffstat (limited to 'dedup/compression.py')
0 files changed, 0 insertions, 0 deletions