Skip to content

Commit 76dc20b

Browse files
committed
Merge branch 'ds/maintenance-part-3' into seen
Parts of "git maintenance" to ease writing crontab entries (and other scheduling system configuration) for it. * ds/maintenance-part-3: maintenance: add troubleshooting guide to docs maintenance: recommended schedule in register/start maintenance: add start/stop subcommands maintenance: add [un]register subcommands for-each-repo: run subcommands on configured repos maintenance: add --schedule option and config maintenance: optionally skip --auto process
2 parents 22610a3 + fd9df56 commit 76dc20b

17 files changed

+697
-6
lines changed

.gitignore

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -67,6 +67,7 @@
6767
/git-filter-branch
6868
/git-fmt-merge-msg
6969
/git-for-each-ref
70+
/git-for-each-repo
7071
/git-format-patch
7172
/git-fsck
7273
/git-fsck-objects

Documentation/config/maintenance.txt

Lines changed: 10 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1,10 +1,20 @@
1+
maintenance.auto::
2+
This boolean config option controls whether some commands run
3+
`git maintenance run --auto` after doing their normal work. Defaults
4+
to true.
5+
16
maintenance.<task>.enabled::
27
This boolean config option controls whether the maintenance task
38
with name `<task>` is run when no `--task` option is specified to
49
`git maintenance run`. These config values are ignored if a
510
`--task` option exists. By default, only `maintenance.gc.enabled`
611
is true.
712

13+
maintenance.<task>.schedule::
14+
This config option controls whether or not the given `<task>` runs
15+
during a `git maintenance run --schedule=<frequency>` command. The
16+
value must be one of "hourly", "daily", or "weekly".
17+
818
maintenance.commit-graph.auto::
919
This integer config option controls how often the `commit-graph` task
1020
should be run as part of `git maintenance run --auto`. If zero, then

Documentation/git-for-each-repo.txt

Lines changed: 59 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,59 @@
1+
git-for-each-repo(1)
2+
====================
3+
4+
NAME
5+
----
6+
git-for-each-repo - Run a Git command on a list of repositories
7+
8+
9+
SYNOPSIS
10+
--------
11+
[verse]
12+
'git for-each-repo' --config=<config> [--] <arguments>
13+
14+
15+
DESCRIPTION
16+
-----------
17+
Run a Git command on a list of repositories. The arguments after the
18+
known options or `--` indicator are used as the arguments for the Git
19+
subprocess.
20+
21+
THIS COMMAND IS EXPERIMENTAL. THE BEHAVIOR MAY CHANGE.
22+
23+
For example, we could run maintenance on each of a list of repositories
24+
stored in a `maintenance.repo` config variable using
25+
26+
-------------
27+
git for-each-repo --config=maintenance.repo maintenance run
28+
-------------
29+
30+
This will run `git -C <repo> maintenance run` for each value `<repo>`
31+
in the multi-valued config variable `maintenance.repo`.
32+
33+
34+
OPTIONS
35+
-------
36+
--config=<config>::
37+
Use the given config variable as a multi-valued list storing
38+
absolute path names. Iterate on that list of paths to run
39+
the given arguments.
40+
+
41+
These config values are loaded from system, global, and local Git config,
42+
as available. If `git for-each-repo` is run in a directory that is not a
43+
Git repository, then only the system and global config is used.
44+
45+
46+
SUBPROCESS BEHAVIOR
47+
-------------------
48+
49+
If any `git -C <repo> <arguments>` subprocess returns a non-zero exit code,
50+
then the `git for-each-repo` process returns that exit code without running
51+
more subprocesses.
52+
53+
Each `git -C <repo> <arguments>` subprocess inherits the standard file
54+
descriptors `stdin`, `stdout`, and `stderr`.
55+
56+
57+
GIT
58+
---
59+
Part of the linkgit:git[1] suite

Documentation/git-maintenance.txt

Lines changed: 87 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -29,13 +29,44 @@ Git repository.
2929
SUBCOMMANDS
3030
-----------
3131

32+
register::
33+
Initialize Git config values so any scheduled maintenance will
34+
start running on this repository. This adds the repository to the
35+
`maintenance.repo` config variable in the current user's global
36+
config and enables some recommended configuration values for
37+
`maintenance.<task>.schedule`. The tasks that are enabled are safe
38+
for running in the background without disrupting foreground
39+
processes.
40+
+
41+
If your repository has no 'maintenance.<task>.schedule' configuration
42+
values set, then Git will set configuration values to some recommended
43+
settings. These settings disable foreground maintenance while performing
44+
maintenance tasks in the background that will not interrupt foreground Git
45+
operations.
46+
3247
run::
3348
Run one or more maintenance tasks. If one or more `--task` options
3449
are specified, then those tasks are run in that order. Otherwise,
3550
the tasks are determined by which `maintenance.<task>.enabled`
3651
config options are true. By default, only `maintenance.gc.enabled`
3752
is true.
3853

54+
start::
55+
Start running maintenance on the current repository. This performs
56+
the same config updates as the `register` subcommand, then updates
57+
the background scheduler to run `git maintenance run --scheduled`
58+
on an hourly basis.
59+
60+
stop::
61+
Halt the background maintenance schedule. The current repository
62+
is not removed from the list of maintained repositories, in case
63+
the background maintenance is restarted later.
64+
65+
unregister::
66+
Remove the current repository from background maintenance. This
67+
only removes the repository from the configured list. It does not
68+
stop the background maintenance processes from running.
69+
3970
TASKS
4071
-----
4172

@@ -110,7 +141,18 @@ OPTIONS
110141
only if certain thresholds are met. For example, the `gc` task
111142
runs when the number of loose objects exceeds the number stored
112143
in the `gc.auto` config setting, or when the number of pack-files
113-
exceeds the `gc.autoPackLimit` config setting.
144+
exceeds the `gc.autoPackLimit` config setting. Not compatible with
145+
the `--schedule` option.
146+
147+
--schedule::
148+
When combined with the `run` subcommand, run maintenance tasks
149+
only if certain time conditions are met, as specified by the
150+
`maintenance.<task>.schedule` config value for each `<task>`.
151+
This config value specifies a number of seconds since the last
152+
time that task ran, according to the `maintenance.<task>.lastRun`
153+
config value. The tasks that are tested are those provided by
154+
the `--task=<task>` option(s) or those with
155+
`maintenance.<task>.enabled` set to true.
114156

115157
--quiet::
116158
Do not report progress or other information over `stderr`.
@@ -122,6 +164,50 @@ OPTIONS
122164
`maintenance.<task>.enabled` configured as `true` are considered.
123165
See the 'TASKS' section for the list of accepted `<task>` values.
124166

167+
168+
TROUBLESHOOTING
169+
---------------
170+
The `git maintenance` command is designed to simplify the repository
171+
maintenance patterns while minimizing user wait time during Git commands.
172+
A variety of configuration options are available to allow customizing this
173+
process. The default maintenance options focus on operations that complete
174+
quickly, even on large repositories.
175+
176+
Users may find some cases where scheduled maintenance tasks do not run as
177+
frequently as intended. Each `git maintenance run` command takes a lock on
178+
the repository's object database, and this prevents other concurrent
179+
`git maintenance run` commands from running on the same repository. Without
180+
this safeguard, competing processes could leave the repository in an
181+
unpredictable state.
182+
183+
The background maintenance schedule runs `git maintenance run` processes
184+
on an hourly basis. Each run executes the "hourly" tasks. At midnight,
185+
that process also executes the "daily" tasks. At midnight on the first day
186+
of the week, that process also executes the "weekly" tasks. A single
187+
process iterates over each registered repository, performing the scheduled
188+
tasks for that frequency. Depending on the number of registered
189+
repositories and their sizes, this process may take longer than an hour.
190+
In this case, multiple `git maintenance run` commands may run on the same
191+
repository at the same time, colliding on the object database lock. This
192+
results in one of the two tasks not running.
193+
194+
If you find that some maintenance windows are taking longer than one hour
195+
to complete, then consider reducing the complexity of your maintenance
196+
tasks. For example, the `gc` task is much slower than the
197+
`incremental-repack` task. However, this comes at a cost of a slightly
198+
larger object database. Consider moving more expensive tasks to be run
199+
less frequently.
200+
201+
Expert users may consider scheduling their own maintenance tasks using a
202+
different schedule than is available through `git maintenance start` and
203+
Git configuration options. These users should be aware of the object
204+
database lock and how concurrent `git maintenance run` commands behave.
205+
Further, the `git gc` command should not be combined with
206+
`git maintenance run` commands. `git gc` modifies the object database
207+
but does not take the lock in the same way as `git maintenance run`. If
208+
possible, use `git maintenance run --task=gc` instead of `git gc`.
209+
210+
125211
GIT
126212
---
127213
Part of the linkgit:git[1] suite

Makefile

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -694,6 +694,7 @@ TEST_BUILTINS_OBJS += test-advise.o
694694
TEST_BUILTINS_OBJS += test-bloom.o
695695
TEST_BUILTINS_OBJS += test-chmtime.o
696696
TEST_BUILTINS_OBJS += test-config.o
697+
TEST_BUILTINS_OBJS += test-crontab.o
697698
TEST_BUILTINS_OBJS += test-ctype.o
698699
TEST_BUILTINS_OBJS += test-date.o
699700
TEST_BUILTINS_OBJS += test-delta.o
@@ -1091,6 +1092,7 @@ BUILTIN_OBJS += builtin/fetch-pack.o
10911092
BUILTIN_OBJS += builtin/fetch.o
10921093
BUILTIN_OBJS += builtin/fmt-merge-msg.o
10931094
BUILTIN_OBJS += builtin/for-each-ref.o
1095+
BUILTIN_OBJS += builtin/for-each-repo.o
10941096
BUILTIN_OBJS += builtin/fsck.o
10951097
BUILTIN_OBJS += builtin/gc.o
10961098
BUILTIN_OBJS += builtin/get-tar-commit-id.o

builtin.h

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -155,6 +155,7 @@ int cmd_fetch(int argc, const char **argv, const char *prefix);
155155
int cmd_fetch_pack(int argc, const char **argv, const char *prefix);
156156
int cmd_fmt_merge_msg(int argc, const char **argv, const char *prefix);
157157
int cmd_for_each_ref(int argc, const char **argv, const char *prefix);
158+
int cmd_for_each_repo(int argc, const char **argv, const char *prefix);
158159
int cmd_format_patch(int argc, const char **argv, const char *prefix);
159160
int cmd_fsck(int argc, const char **argv, const char *prefix);
160161
int cmd_gc(int argc, const char **argv, const char *prefix);

builtin/for-each-repo.c

Lines changed: 58 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,58 @@
1+
#include "cache.h"
2+
#include "config.h"
3+
#include "builtin.h"
4+
#include "parse-options.h"
5+
#include "run-command.h"
6+
#include "string-list.h"
7+
8+
static const char * const for_each_repo_usage[] = {
9+
N_("git for-each-repo --config=<config> <command-args>"),
10+
NULL
11+
};
12+
13+
static int run_command_on_repo(const char *path,
14+
void *cbdata)
15+
{
16+
int i;
17+
struct child_process child = CHILD_PROCESS_INIT;
18+
struct strvec *args = (struct strvec *)cbdata;
19+
20+
child.git_cmd = 1;
21+
strvec_pushl(&child.args, "-C", path, NULL);
22+
23+
for (i = 0; i < args->nr; i++)
24+
strvec_push(&child.args, args->v[i]);
25+
26+
return run_command(&child);
27+
}
28+
29+
int cmd_for_each_repo(int argc, const char **argv, const char *prefix)
30+
{
31+
static const char *config_key = NULL;
32+
int i, result = 0;
33+
const struct string_list *values;
34+
struct strvec args = STRVEC_INIT;
35+
36+
const struct option options[] = {
37+
OPT_STRING(0, "config", &config_key, N_("config"),
38+
N_("config key storing a list of repository paths")),
39+
OPT_END()
40+
};
41+
42+
argc = parse_options(argc, argv, prefix, options, for_each_repo_usage,
43+
PARSE_OPT_STOP_AT_NON_OPTION);
44+
45+
if (!config_key)
46+
die(_("missing --config=<config>"));
47+
48+
for (i = 0; i < argc; i++)
49+
strvec_push(&args, argv[i]);
50+
51+
values = repo_config_get_value_multi(the_repository,
52+
config_key);
53+
54+
for (i = 0; !result && i < values->nr; i++)
55+
result = run_command_on_repo(values->items[i].string, &args);
56+
57+
return result;
58+
}

0 commit comments

Comments
 (0)