On Sun, Dec 16, 2018 at 01:57:57PM -0800, [email protected] wrote:
> From: Nickolai Belakovski <[email protected]>
>
> Add an atom proving the path of the linked worktree where this ref is
> checked out, if it is checked out in any linked worktrees, and empty
> string otherwise.
I stumbled over the word "proving" here. Maybe "showing" would be more
clear?
> diff --git a/Documentation/git-for-each-ref.txt
> b/Documentation/git-for-each-ref.txt
> index 901faef1bf..9590f7beab 100644
> --- a/Documentation/git-for-each-ref.txt
> +++ b/Documentation/git-for-each-ref.txt
> @@ -209,6 +209,10 @@ symref::
> `:lstrip` and `:rstrip` options in the same way as `refname`
> above.
>
> +worktreepath::
> + The absolute path to the worktree in which the ref is checked
> + out, if it is checked out in any linked worktree. ' ' otherwise.
> +
Normally single-quotes are used in asciidoc to emphasize text, and the
quotes aren't passed through. Asciidoc (and asciidoctor) do seem to
render the literal quotes here, which is good. I wonder if it would be
more clear to just write it out, though, like:
...any linked worktree. Otherwise, replaced with a single space.
Also, why are we replacing it with a single space? Wouldn't the empty
string be more customary (and work with the other "if empty, then do
this" formatting options)?
> @@ -34,6 +36,8 @@ static struct ref_msg {
> "ahead %d, behind %d"
> };
>
> +static struct worktree ** worktrees;
Minor style nit: we put the "*" in a pointer declaration next to the
variable name, without intervening whitespace. Like:
static struct worktree **worktrees;
> @@ -75,6 +79,12 @@ static struct expand_data {
> struct object_info info;
> } oi, oi_deref;
>
> +struct reftoworktreeinfo_entry {
> + struct hashmap_entry ent; // must be the first member!
> + char * ref; // key into map
> + struct worktree * wt;
> +};
A few style nits:
- the "*" space thing from above (it's in other places below, too, but
I won't point out each)
- we prefer "/* */" comments, even for single-liners
- since we do all-lowercase identifiers, use more underscores to break
things up. E.g., ref_to_worktree_entry.
Here we store the refname as a separate variable, but then point to the
worktree itself to access wt->path. Why do we treat these differently?
I.e., I'd expect to see either:
1. Each entry holding a single worktree object, and using its head_ref
and path fields, like:
struct ref_to_worktree_entry {
struct hashmap_entry ent; /* must be first */
struct worktree *wt;
};
....
entry = xmalloc(sizeof(*entry));
entry->wt = wt;
hashmap_entry_init(entry, strhash(wt->head_ref));
...
strbuf_addstr(&out, result->wt->path);
2. Each entry containing just the bits it needs, like:
struct ref_to_worktree_entry {
struct hashmap_entry ent; /* must be first */
char *ref;
char *path;
};
...
/*
* We could use FLEXPTR_ALLOC_STR() here, but it doesn't actually
* support holding _two_ strings. Separate allocations probably
* aren't a huge deal here, since there are only a handful of
* worktrees.
*/
entry = xmalloc(sizeof(*entry));
entry->ref = wt->head_ref;
entry->path = wt->path;
hashmap_entry_init(entry, strhash(entry->ref));
...
strbuf_addstr(&out, result->path);
I think the first one is strictly preferable unless we're worried about
the lifetime of the "struct worktree" going away. I don't think that's
an issue, though; they are ours until we call free_worktrees().
> @@ -114,6 +124,7 @@ static struct used_atom {
> } objectname;
> struct refname_atom refname;
> char *head;
> + struct hashmap reftoworktreeinfo_map;
> } u;
> } *used_atom;
This uses one map for each %(worktree) we use. But won't they all be the
same? It would ideally be associated with the ref-filter. There's no
ref-filter context struct to hold this kind of data, just static globals
in ref-filter.c (including this used_atom struct!). That's something
we'll probably need to fix in the long run, but I think it would be
reasonable to just have:
static struct hashmap ref_to_worktree_map;
next to the declaration of used_atom_cnt, need_symref, etc. And then
those can all eventually get moved into a struct together.
> @@ -461,6 +497,7 @@ static struct {
> { "flag", SOURCE_NONE },
> { "HEAD", SOURCE_NONE, FIELD_STR, head_atom_parser },
> { "color", SOURCE_NONE, FIELD_STR, color_atom_parser },
> + { "worktreepath", SOURCE_NONE, FIELD_STR, worktree_atom_parser },
> { "align", SOURCE_NONE, FIELD_STR, align_atom_parser },
> { "end", SOURCE_NONE },
> { "if", SOURCE_NONE, FIELD_STR, if_atom_parser },
Marking as SOURCE_NONE makes sense.
> +static const char * get_worktree_info(const struct used_atom *atom, const
> struct ref_array_item *ref)
> +{
> + struct strbuf val = STRBUF_INIT;
> + struct reftoworktreeinfo_entry * entry;
> + struct reftoworktreeinfo_entry * lookup_result;
> +
> + FLEXPTR_ALLOC_STR(entry, ref, ref->refname);
> + hashmap_entry_init(entry, strhash(entry->ref));
> + lookup_result = hashmap_get(&(atom->u.reftoworktreeinfo_map), entry,
> NULL);
> + free(entry);
We shouldn't need to do an allocation just for a lookup. That's what the
extra "keydata" parameter is for in the comparison function. And I guess
this is what led you to have "char *ref" in the struct, rather than
reusing wt->head_ref (because you don't have a "struct worktree" here).
You should be able to do it like this:
struct hashmap_entry entry;
struct ref_to_worktree_entry *result;
hashmap_entry_init(entry, strhash(ref->refname));
result = hashmap_get(&ref_to_worktree_map, &entry, ref->refname));
...
and then your comparison function would look like this:
int ref_to_worktree_hashcmp(const void *data,
const void *entry,
const void *entry_or_key,
const void *keydata)
{
const struct ref_to_worktree_entry *a = entry;
const struct ref_to_worktree_entry *b = entry;
if (keydata)
return strcmp(a->wt->head_ref, keydata);
else
return strcmp(a->wt->head_ref, b->wt->head_ref);
}
If you're thinking that this API is totally confusing and hard to figure
out, I agree. It's optimized to avoid extra allocations. I wish we had a
better one for simple cases (especially string->string mappings like
this).
Speaking of comparison functions, I didn't see one in your patch. Don't
you need to pass one to hashmap_init?
> + if (lookup_result)
> + {
> + if (!strncmp(atom->name, "worktreepath", strlen(atom->name)))
> + strbuf_addstr(&val, lookup_result->wt->path);
> + }
> + else
> + strbuf_addstr(&val, " ");
What's this extra strncmp about? If we're _not_ a worktreepath atom,
we'd still do the lookup only to put nothing in the string?
I think we'd only call this function when populate_value() sees a
worktreepath atom, though:
> @@ -1537,6 +1596,10 @@ static int populate_value(struct ref_array_item *ref,
> struct strbuf *err)
>
> if (starts_with(name, "refname"))
> refname = get_refname(atom, ref);
> + else if (starts_with(name, "worktreepath")) {
> + v->s = get_worktree_info(atom, ref);
> + continue;
> + }
So it would be OK to drop the check of atom->name again inside
get_worktree_info().
> @@ -2013,7 +2076,14 @@ void ref_array_clear(struct ref_array *array)
> int i;
>
> for (i = 0; i < used_atom_cnt; i++)
> + {
> + if (!strncmp(used_atom[i].name, "worktreepath",
> strlen("worktreepath")))
> + {
> + hashmap_free(&(used_atom[i].u.reftoworktreeinfo_map),
> 1);
> + free_worktrees(worktrees);
> + }
And if we move the mapping out to a static global, then this only has to
be done once, not once per atom. In fact, I think this could double-free
"worktrees" with your current patch if you have two "%(worktree)"
placeholders, since "worktrees" already is a global.
> diff --git a/t/t6302-for-each-ref-filter.sh b/t/t6302-for-each-ref-filter.sh
> index fc067ed672..add70a4c3e 100755
> --- a/t/t6302-for-each-ref-filter.sh
> +++ b/t/t6302-for-each-ref-filter.sh
> @@ -441,4 +441,19 @@ test_expect_success '--merged is incompatible with
> --no-merged' '
> test_must_fail git for-each-ref --merged HEAD --no-merged HEAD
> '
>
> +test_expect_success '"add" a worktree' '
> + mkdir worktree_dir &&
> + git worktree add -b master_worktree worktree_dir master
> +'
> +
> +test_expect_success 'validate worktree atom' '
> + cat >expect <<-\EOF &&
> + master: checked out in a worktree
> + master_worktree: checked out in a worktree
> + side: not checked out in a worktree
> + EOF
> + git for-each-ref --format="%(refname:short):
> %(if)%(worktreepath)%(then)checked out in a worktree%(else)not checked out in
> a worktree%(end)" refs/heads/ >actual &&
> + test_cmp expect actual
> +'
It's probably worth testing that the path we get is actually sane, too.
I.e., expect something more like:
cat >expect <<-\EOF
master: $PWD
master: $PWD/worktree
side: not checked out
EOF
git for-each-ref \
--format="%(refname:short):
%(if)%(worktreepath)%(then)%(worktreepath)%(else)not checked %out%(end)
(I wish there was a way to avoid that really long line, but I don't
think there is).
-Peff