Clean reconstructed objects outside pack window #716

jfontan · 2018-01-11T17:09:20Z

Object walk reconstructs delta objects but these are not cleaned up
after they got out the pack window. Without this change all
reconstructed objects reside in memory.

restoreOriginal call is moved before calling Size(). Now we can not
guarantee that the object is already undeltified.

Object walk reconstructs delta objects but these are not cleaned up after they got out the pack window. Without this change all reconstructed objects reside in memory. restoreOriginal call is moved before calling Size(). Now we can not guarantee that the object is already undeltified. Signed-off-by: Javi Fontan <[email protected]>

erizocosmico · 2018-01-11T17:16:12Z

plumbing/format/packfile/delta_selector.go

@@ -261,6 +267,16 @@ func (dw *deltaSelector) walk(
 }

 func (dw *deltaSelector) tryToDeltify(indexMap map[plumbing.Hash]*deltaIndex, base, target *ObjectToPack) error {
+	// Original object might not be present if we're reusing a delta, so we
+	// ensure it is restored.
+	if err := dw.restoreOriginal(target); err != nil {


isn't this making it actually slower than before (even if it uses way less memory)?

I've just run borges pack with repository https://github.com/numpy/numpy (downloaded locally) and the results are quite close.

Original version:

203.83user 12.34system 2:28.80elapsed 2.4 Gb RAM

204.40user 12.27system 2:29.30elapsed 2.4 Gb RAM

With proposed changes:

217.82user 11.97system 2:25.72elapsed 1 Gb RAM

220.06user 11.81system 2:26.28elapsed 1 Gb RAM

It seems that it's a bit slower (user time is bigger) but it spends more system time. Maybe heap allocation.

erizocosmico · 2018-01-11T19:23:53Z

Then lgtm. That little slowdown is a nice tradeoff for half the memory

mcuadros · 2018-01-13T02:32:56Z

Can we test this with more diverse repository? Not only with the numpy repository?

jfontan · 2018-01-15T09:59:16Z

I've did new tests with the following repos:

cangallo: small repository (packfile 93KiB), https://github.com/jfontan/cangallo
octoprint-tft: small repository (packfile 3.1MiB), https://github.com/mcuadros/OctoPrint-TFT
upsilon: small repository, https://github.com/upsilonproject/upsilon-common
numpy: average repository, https://github.com/numpy/numpy
tensorflow, average repository, https://github.com/tensorflow/tensorflow
bismuth, some files are 100Mb in size, https://github.com/hclivess/Bismuth

The times and memory are only from the push action from local to a new repository. Each test is executed twice and the smaller value for time and memory is selected.

repository	master	fix memory
cangallo	145ms 3 MiB	151ms 2.6MiB
octoprint-tft	4s 32.6MiB	3.8s 33.2MiB
upsilon	54.2s 387MiB	53.4s 387MiB
numpy	1m25s 1.9GiB	1m19s 0.65GiB
tensorflow	3m16s 3.9GiB	2m59s 1.28GiB
bismuth	12m16s 17.7GiB	10m37s 3.13GiB

The code to do the benchmark is here: https://gist.github.com/jfontan/42fbfe0761e5280012d285c1a4290f5d

erizocosmico · 2018-01-15T14:04:23Z

So it's actually faster? And uses half the memory? Nice, great job!

mcuadros · 2018-01-15T15:11:33Z

Networking is envolved, so speed number aren't reliable

jfontan · 2018-01-15T15:25:16Z

I've just updated the table with two small repos.

jfontan requested review from ajnavarro, erizocosmico and mcuadros January 11, 2018 17:09

erizocosmico reviewed Jan 11, 2018

View reviewed changes

ajnavarro approved these changes Jan 12, 2018

View reviewed changes

erizocosmico approved these changes Jan 12, 2018

View reviewed changes

mcuadros merged commit 861e399 into src-d:master Jan 15, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Clean reconstructed objects outside pack window #716

Clean reconstructed objects outside pack window #716

Uh oh!

jfontan commented Jan 11, 2018

Uh oh!

erizocosmico Jan 11, 2018

Uh oh!

jfontan Jan 11, 2018

Uh oh!

erizocosmico commented Jan 11, 2018 via email •

edited by mcuadros

Loading

Uh oh!

mcuadros commented Jan 13, 2018

Uh oh!

jfontan commented Jan 15, 2018 •

edited

Loading

Uh oh!

erizocosmico commented Jan 15, 2018

Uh oh!

mcuadros commented Jan 15, 2018

Uh oh!

jfontan commented Jan 15, 2018

Uh oh!

Uh oh!

Clean reconstructed objects outside pack window #716

Clean reconstructed objects outside pack window #716

Uh oh!

Conversation

jfontan commented Jan 11, 2018

Uh oh!

erizocosmico Jan 11, 2018

Choose a reason for hiding this comment

Uh oh!

jfontan Jan 11, 2018

Choose a reason for hiding this comment

Uh oh!

erizocosmico commented Jan 11, 2018 via email • edited by mcuadros Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mcuadros commented Jan 13, 2018

Uh oh!

jfontan commented Jan 15, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

erizocosmico commented Jan 15, 2018

Uh oh!

mcuadros commented Jan 15, 2018

Uh oh!

jfontan commented Jan 15, 2018

Uh oh!

Uh oh!

erizocosmico commented Jan 11, 2018 via email •

edited by mcuadros

Loading

jfontan commented Jan 15, 2018 •

edited

Loading