Age | Commit message (Collapse) | Author |
|
To facilitate this importpkg.py decodes filenames using iso-8859-1.
While this is not the encoding used for most filenames, decoding will
never fail. Conversely, readyaml.py encodes to iso-8859-1 to undo the
above effect.
webapp.py cannot currently cope with the above change and is broken by
this commit.
|
|
|
|
This should reduce the query bandwidth to the rdbms.
|
|
This already worked quite well for package.id. On a test data set of 5%
size this transformation reduces the database size by about 4%.
|
|
One approach to improve performance is to reduce the database size. A
package name takes up 15 bytes in average. A number of a package takes
up two bytes. Multiply that difference with the number of references and
it should be noticeably. A small test set show a reduction by 10%.
|
|
This appears to be a huge performance boost.
|
|
importpkg.py now emits a yaml stream instead of updating the database.
The acutual updating now happens in readyaml.py. In this process
autoimport.py was significantly reworked to import packages in parallel.
|