Skip to content

Commit 4479292

Browse files
committed
Merge branch 'ds/maintenance-part-3' into seen
Parts of "git maintenance" to ease writing crontab entries (and other scheduling system configuration) for it. cf. <[email protected]> * ds/maintenance-part-3: maintenance: add troubleshooting guide to docs maintenance: use default schedule if not configured maintenance: add start/stop subcommands maintenance: add [un]register subcommands for-each-repo: run subcommands on configured repos maintenance: add --schedule option and config maintenance: optionally skip --auto process
2 parents edbc0bf + 5f1c2c2 commit 4479292

17 files changed

+705
-6
lines changed

.gitignore

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -68,6 +68,7 @@
6868
/git-filter-branch
6969
/git-fmt-merge-msg
7070
/git-for-each-ref
71+
/git-for-each-repo
7172
/git-format-patch
7273
/git-fsck
7374
/git-fsck-objects

Documentation/config/maintenance.txt

Lines changed: 10 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1,10 +1,20 @@
1+
maintenance.auto::
2+
This boolean config option controls whether some commands run
3+
`git maintenance run --auto` after doing their normal work. Defaults
4+
to true.
5+
16
maintenance.<task>.enabled::
27
This boolean config option controls whether the maintenance task
38
with name `<task>` is run when no `--task` option is specified to
49
`git maintenance run`. These config values are ignored if a
510
`--task` option exists. By default, only `maintenance.gc.enabled`
611
is true.
712

13+
maintenance.<task>.schedule::
14+
This config option controls whether or not the given `<task>` runs
15+
during a `git maintenance run --schedule=<frequency>` command. The
16+
value must be one of "hourly", "daily", or "weekly".
17+
818
maintenance.commit-graph.auto::
919
This integer config option controls how often the `commit-graph` task
1020
should be run as part of `git maintenance run --auto`. If zero, then

Documentation/git-for-each-repo.txt

Lines changed: 59 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,59 @@
1+
git-for-each-repo(1)
2+
====================
3+
4+
NAME
5+
----
6+
git-for-each-repo - Run a Git command on a list of repositories
7+
8+
9+
SYNOPSIS
10+
--------
11+
[verse]
12+
'git for-each-repo' --config=<config> [--] <arguments>
13+
14+
15+
DESCRIPTION
16+
-----------
17+
Run a Git command on a list of repositories. The arguments after the
18+
known options or `--` indicator are used as the arguments for the Git
19+
subprocess.
20+
21+
THIS COMMAND IS EXPERIMENTAL. THE BEHAVIOR MAY CHANGE.
22+
23+
For example, we could run maintenance on each of a list of repositories
24+
stored in a `maintenance.repo` config variable using
25+
26+
-------------
27+
git for-each-repo --config=maintenance.repo maintenance run
28+
-------------
29+
30+
This will run `git -C <repo> maintenance run` for each value `<repo>`
31+
in the multi-valued config variable `maintenance.repo`.
32+
33+
34+
OPTIONS
35+
-------
36+
--config=<config>::
37+
Use the given config variable as a multi-valued list storing
38+
absolute path names. Iterate on that list of paths to run
39+
the given arguments.
40+
+
41+
These config values are loaded from system, global, and local Git config,
42+
as available. If `git for-each-repo` is run in a directory that is not a
43+
Git repository, then only the system and global config is used.
44+
45+
46+
SUBPROCESS BEHAVIOR
47+
-------------------
48+
49+
If any `git -C <repo> <arguments>` subprocess returns a non-zero exit code,
50+
then the `git for-each-repo` process returns that exit code without running
51+
more subprocesses.
52+
53+
Each `git -C <repo> <arguments>` subprocess inherits the standard file
54+
descriptors `stdin`, `stdout`, and `stderr`.
55+
56+
57+
GIT
58+
---
59+
Part of the linkgit:git[1] suite

Documentation/git-maintenance.txt

Lines changed: 96 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -29,13 +29,53 @@ Git repository.
2929
SUBCOMMANDS
3030
-----------
3131

32+
register::
33+
Initialize Git config values so any scheduled maintenance will
34+
start running on this repository. This adds the repository to the
35+
`maintenance.repo` config variable in the current user's global
36+
config and enables some recommended configuration values for
37+
`maintenance.<task>.schedule`. The tasks that are enabled are safe
38+
for running in the background without disrupting foreground
39+
processes.
40+
+
41+
If your repository has no `maintenance.<task>.schedule` configuration
42+
values set, then Git will use a recommended default schedule that performs
43+
background maintenance that will not interrupt foreground commands. The
44+
default schedule is as follows:
45+
+
46+
* `gc`: disabled.
47+
* `commit-graph`: hourly.
48+
* `prefetch`: hourly.
49+
* `loose-objects`: daily.
50+
* `incremental-repack`: daily.
51+
+
52+
`git maintenance register` will also disable foreground maintenance by
53+
setting `maintenance.auto = false` in the current repository. This config
54+
setting will remain after a `git maintenance unregister` command.
55+
3256
run::
3357
Run one or more maintenance tasks. If one or more `--task` options
3458
are specified, then those tasks are run in that order. Otherwise,
3559
the tasks are determined by which `maintenance.<task>.enabled`
3660
config options are true. By default, only `maintenance.gc.enabled`
3761
is true.
3862

63+
start::
64+
Start running maintenance on the current repository. This performs
65+
the same config updates as the `register` subcommand, then updates
66+
the background scheduler to run `git maintenance run --scheduled`
67+
on an hourly basis.
68+
69+
stop::
70+
Halt the background maintenance schedule. The current repository
71+
is not removed from the list of maintained repositories, in case
72+
the background maintenance is restarted later.
73+
74+
unregister::
75+
Remove the current repository from background maintenance. This
76+
only removes the repository from the configured list. It does not
77+
stop the background maintenance processes from running.
78+
3979
TASKS
4080
-----
4181

@@ -110,7 +150,18 @@ OPTIONS
110150
only if certain thresholds are met. For example, the `gc` task
111151
runs when the number of loose objects exceeds the number stored
112152
in the `gc.auto` config setting, or when the number of pack-files
113-
exceeds the `gc.autoPackLimit` config setting.
153+
exceeds the `gc.autoPackLimit` config setting. Not compatible with
154+
the `--schedule` option.
155+
156+
--schedule::
157+
When combined with the `run` subcommand, run maintenance tasks
158+
only if certain time conditions are met, as specified by the
159+
`maintenance.<task>.schedule` config value for each `<task>`.
160+
This config value specifies a number of seconds since the last
161+
time that task ran, according to the `maintenance.<task>.lastRun`
162+
config value. The tasks that are tested are those provided by
163+
the `--task=<task>` option(s) or those with
164+
`maintenance.<task>.enabled` set to true.
114165

115166
--quiet::
116167
Do not report progress or other information over `stderr`.
@@ -122,6 +173,50 @@ OPTIONS
122173
`maintenance.<task>.enabled` configured as `true` are considered.
123174
See the 'TASKS' section for the list of accepted `<task>` values.
124175

176+
177+
TROUBLESHOOTING
178+
---------------
179+
The `git maintenance` command is designed to simplify the repository
180+
maintenance patterns while minimizing user wait time during Git commands.
181+
A variety of configuration options are available to allow customizing this
182+
process. The default maintenance options focus on operations that complete
183+
quickly, even on large repositories.
184+
185+
Users may find some cases where scheduled maintenance tasks do not run as
186+
frequently as intended. Each `git maintenance run` command takes a lock on
187+
the repository's object database, and this prevents other concurrent
188+
`git maintenance run` commands from running on the same repository. Without
189+
this safeguard, competing processes could leave the repository in an
190+
unpredictable state.
191+
192+
The background maintenance schedule runs `git maintenance run` processes
193+
on an hourly basis. Each run executes the "hourly" tasks. At midnight,
194+
that process also executes the "daily" tasks. At midnight on the first day
195+
of the week, that process also executes the "weekly" tasks. A single
196+
process iterates over each registered repository, performing the scheduled
197+
tasks for that frequency. Depending on the number of registered
198+
repositories and their sizes, this process may take longer than an hour.
199+
In this case, multiple `git maintenance run` commands may run on the same
200+
repository at the same time, colliding on the object database lock. This
201+
results in one of the two tasks not running.
202+
203+
If you find that some maintenance windows are taking longer than one hour
204+
to complete, then consider reducing the complexity of your maintenance
205+
tasks. For example, the `gc` task is much slower than the
206+
`incremental-repack` task. However, this comes at a cost of a slightly
207+
larger object database. Consider moving more expensive tasks to be run
208+
less frequently.
209+
210+
Expert users may consider scheduling their own maintenance tasks using a
211+
different schedule than is available through `git maintenance start` and
212+
Git configuration options. These users should be aware of the object
213+
database lock and how concurrent `git maintenance run` commands behave.
214+
Further, the `git gc` command should not be combined with
215+
`git maintenance run` commands. `git gc` modifies the object database
216+
but does not take the lock in the same way as `git maintenance run`. If
217+
possible, use `git maintenance run --task=gc` instead of `git gc`.
218+
219+
125220
GIT
126221
---
127222
Part of the linkgit:git[1] suite

Makefile

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -694,6 +694,7 @@ TEST_BUILTINS_OBJS += test-advise.o
694694
TEST_BUILTINS_OBJS += test-bloom.o
695695
TEST_BUILTINS_OBJS += test-chmtime.o
696696
TEST_BUILTINS_OBJS += test-config.o
697+
TEST_BUILTINS_OBJS += test-crontab.o
697698
TEST_BUILTINS_OBJS += test-ctype.o
698699
TEST_BUILTINS_OBJS += test-date.o
699700
TEST_BUILTINS_OBJS += test-delta.o
@@ -1093,6 +1094,7 @@ BUILTIN_OBJS += builtin/fetch-pack.o
10931094
BUILTIN_OBJS += builtin/fetch.o
10941095
BUILTIN_OBJS += builtin/fmt-merge-msg.o
10951096
BUILTIN_OBJS += builtin/for-each-ref.o
1097+
BUILTIN_OBJS += builtin/for-each-repo.o
10961098
BUILTIN_OBJS += builtin/fsck.o
10971099
BUILTIN_OBJS += builtin/gc.o
10981100
BUILTIN_OBJS += builtin/get-tar-commit-id.o

builtin.h

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -156,6 +156,7 @@ int cmd_fetch(int argc, const char **argv, const char *prefix);
156156
int cmd_fetch_pack(int argc, const char **argv, const char *prefix);
157157
int cmd_fmt_merge_msg(int argc, const char **argv, const char *prefix);
158158
int cmd_for_each_ref(int argc, const char **argv, const char *prefix);
159+
int cmd_for_each_repo(int argc, const char **argv, const char *prefix);
159160
int cmd_format_patch(int argc, const char **argv, const char *prefix);
160161
int cmd_fsck(int argc, const char **argv, const char *prefix);
161162
int cmd_gc(int argc, const char **argv, const char *prefix);

builtin/for-each-repo.c

Lines changed: 58 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,58 @@
1+
#include "cache.h"
2+
#include "config.h"
3+
#include "builtin.h"
4+
#include "parse-options.h"
5+
#include "run-command.h"
6+
#include "string-list.h"
7+
8+
static const char * const for_each_repo_usage[] = {
9+
N_("git for-each-repo --config=<config> <command-args>"),
10+
NULL
11+
};
12+
13+
static int run_command_on_repo(const char *path,
14+
void *cbdata)
15+
{
16+
int i;
17+
struct child_process child = CHILD_PROCESS_INIT;
18+
struct strvec *args = (struct strvec *)cbdata;
19+
20+
child.git_cmd = 1;
21+
strvec_pushl(&child.args, "-C", path, NULL);
22+
23+
for (i = 0; i < args->nr; i++)
24+
strvec_push(&child.args, args->v[i]);
25+
26+
return run_command(&child);
27+
}
28+
29+
int cmd_for_each_repo(int argc, const char **argv, const char *prefix)
30+
{
31+
static const char *config_key = NULL;
32+
int i, result = 0;
33+
const struct string_list *values;
34+
struct strvec args = STRVEC_INIT;
35+
36+
const struct option options[] = {
37+
OPT_STRING(0, "config", &config_key, N_("config"),
38+
N_("config key storing a list of repository paths")),
39+
OPT_END()
40+
};
41+
42+
argc = parse_options(argc, argv, prefix, options, for_each_repo_usage,
43+
PARSE_OPT_STOP_AT_NON_OPTION);
44+
45+
if (!config_key)
46+
die(_("missing --config=<config>"));
47+
48+
for (i = 0; i < argc; i++)
49+
strvec_push(&args, argv[i]);
50+
51+
values = repo_config_get_value_multi(the_repository,
52+
config_key);
53+
54+
for (i = 0; !result && i < values->nr; i++)
55+
result = run_command_on_repo(values->items[i].string, &args);
56+
57+
return result;
58+
}

0 commit comments

Comments
 (0)