Decide on a replacement for $Id$ tags in files [#819874]

Comment	File	Size	Author
#21	git_attribute_expansion_testing.png	85.29 KB	rocketeerbkw

Comment #1

sdboyer CreditAttribution: sdboyer commented 6 June 2010 at 20:14

Title:	Write a script to recursively strip $Id$ tags out of a project	» Create a script to recursively strip $Id$ tags out of a project
Priority:	Normal	» Minor

This really isn't a big thing, though...

Log in or register to post comments

Comment #2

ryanaghdam CreditAttribution: ryanaghdam commented 16 June 2010 at 13:26

Assigned:

Unassigned

» ryanaghdam

Log in or register to post comments

Comment #3

mikey_p CreditAttribution: mikey_p commented 16 June 2010 at 20:23

[snip] they'll still be in files wherever they were before, albeit unexpanded.

I very much doubt this, I would bet most of the tags are actually expanded.

Log in or register to post comments

Comment #4

sdboyer CreditAttribution: sdboyer commented 2 July 2010 at 21:31

No, they'll be unexpanded. That's going to happen as part of the migration path I've engineered. It'll actually be like they never existed at all - cvs2svn will use the kill keywords option when exporting the cvs revisions, so they'll never be present at all in the git history.

Log in or register to post comments

Comment #5

rocketeerbkw CreditAttribution: rocketeerbkw commented 2 July 2010 at 23:54

Should all CVS keywords be accounted for? I had to google for a list of them and found this http://ximbiot.com/cvs/manual/cvs-1.11.6/cvs_12.html

Issue title specifies $Id$ but description says

remove the tags in all their various forms

Log in or register to post comments

Comment #6

webchick

she/they

English

Vancouver 🇨🇦

CreditAttribution: webchick commented 2 July 2010 at 23:55

Sam, is there documentation for how one runs through the migration path locally if someone* wanted to work on this, against some realistic test data?

* Not volunteering! :D

Log in or register to post comments

Comment #7

sdboyer CreditAttribution: sdboyer commented 3 July 2010 at 01:28

@rocketeerbkw: only the $Id$ tag is strictly necessary. That's all that we're using in a standard way with Drupal; if folks are using those other ones in their files, that's their thing. Then again, if you're gonna do $Id$, it probably wouldn't be hard to do the others, too...maybe we provide two versions of the script.

@webchick: unfortunately, there isn't any documentation on doing the full migration path on one's own. I've been meaning to do it up, but haven't worked on the migration path itself in a little while and so have let it slide. There are some scripts on github that DamZ and I worked on which will get you most of the way there, but will unfortunately not do this _specific_ piece correctly because to make cvs2svn use kill keywords, you have to hack it the code. I could probably include that as part of the instructions, though...

Log in or register to post comments

Comment #8

chx CreditAttribution: chx commented 3 July 2010 at 02:30

And this is good why?

Log in or register to post comments

Comment #9

xmacinfo

he/him

French

Canada

CreditAttribution: xmacinfo commented 4 July 2010 at 15:30

Is there a way to keep $Id$ in the files with GIT? Or at least transform them to comments?

Without these tags it will be next to impossible for most of our users to distinguish which file is more recent, or which version to file is, etc.

Also, when posting issues on d.o., we will not be able to say which version of the file we need help with.

This is specially true to users who work with FTP and don't use any version control.

Log in or register to post comments

Comment #10

sdboyer CreditAttribution: sdboyer commented 4 July 2010 at 19:11

@chx - I don't mind you being snarky as long as you're specific about it. Do you not like the idea of stripping the tags? That's fine, you don't have to apply it to all those contrib modules you maintain. Or are you objecting to not having $Id$-type metadata present in the files themselves? That's a different discussion, dovetailing with...

@xmacinfo - $Id$ will be present in the files after the migration occurs, but it will be unexpanded (e.g., it will be just literally, '$Id$', without any revision information). We could do it in a way where the tags are still expanded in the final revision, but (IMO) doing so would be pointless and confusing: pointless, because once we switch to git, that metadata will be forever frozen in whatever state it was in when imported from CVS, and confusing, because as soon as that file is changed but the metadata stays the same, the metadata is actually saying something UNTRUE about the state of that file.

If your users use the CVS revision number to identify the version of the file they're working with, then I'm sorry, but that's just the wrong way to do things. If they need to identify the version of the file but aren't using version control, then they should use the module version information provided by Drupal itself. If your users are regularly working with -dev releases and/or making modifications to files, then they should be using version control. Relying on CVS metadata tags, which will only be there if the author bothered to include the tag, is just a deficient way of doing things.

Log in or register to post comments

Comment #11

sdboyer CreditAttribution: sdboyer commented 4 July 2010 at 19:21

@rocketeerbkw: Oh, just realized I half-answered your question. I said "in all their various forms" because the tag can be in any kind of comment, in any file type. So we have to account for more than just // $Id$, but also /* $Id$ */, as an example.

Log in or register to post comments

Comment #12

chx CreditAttribution: chx commented 4 July 2010 at 20:47

So in case this was not evident: the $Id$ tag gives you a human readable way to check on the freshness of a file. It's not a machine thing.

Log in or register to post comments

Comment #13

xmacinfo

he/him

French

Canada

CreditAttribution: xmacinfo commented 4 July 2010 at 21:07

@sdboyer: Agreed, in an ideal world everything should be under version control. However, there are real life example where having the revision number displayed inside the file would be important.

For example, the .htaccess file and the settings.php files. These two are often modified in production to override settings for various reasons. These two files do not always change from release to release.

A FTP user uploading files to its shared hosting repository cannot version control the file. For most of Drupal files, a simple file replacement works nicely, but for settings.php and .htaccess (if some settings are overridden), we need a way to know about the changes.

With CVS metadata, that was easy to spot a change in those two files. Without metadata, we will need to diff the files, see if there are changes, and if there are changes, apply the changes to the file.

There are a lot of support issues just for those two files and I am looking for a way to display version information inside these files.

So I guess I worry only about these two files: settings.php and .htaccess.

And I worry about the vast majority of users that download a tarball and do not use version control or cannot use version control.

How complicated would it to display revision number inside settings.php and .htaccess?

Log in or register to post comments

Comment #14

sdboyer CreditAttribution: sdboyer commented 4 July 2010 at 21:32

Title:

Create a script to recursively strip $Id$ tags out of a project

» Decide on a replacement for $Id$, then create a script to automatically update projects

So, just had a long conversation about this in IRC, wherein the points xmacinfo raises were covered, in addition to others. Mea culpa, we do need to retain some kind of version string. I think unexpanding the keywords for the git history is still a good idea (not doing so will make git merges with CVS history in them awkward), but we do still need to retain some version numbering in the files. Git has a placeholder expansion system that we can use to do more than what was possible with CVS - see #720598-13: Consider using git-archive expansion for .info files for the list. The big drawback is that you can't immediately look SHAs and know which came first, as you could with CVS numbering. So the best we can do for that is dates.

In any case, let's repurpose this thread to a) decide on what we want to replace $Id$ with, then b) create a script, or at least figure out the regex, for doing so.

Note that a script may not end up being feasible, as we may need to rewrite ALL of the history with this new formatting system, which will mean applying the regex to every revision during the migration process itself. The reason why that might be necessary has flitted out of my mind, but I'm putting it out there anyway.

Log in or register to post comments

Comment #15

xmacinfo

he/him

French

Canada

CreditAttribution: xmacinfo commented 5 July 2010 at 00:02

a) decide on what we want to replace $Id$ with

Any example of what it would look like?

Also, does this replacement need to be visible on all files or only on a subset of files?

As for specific information inside the *new* tag, if the 'date' is available and printed there, I am sure this will be enough for the use cases I have in mind.

Log in or register to post comments

Comment #16

rocketeerbkw CreditAttribution: rocketeerbkw commented 5 July 2010 at 00:21

@sdboyer: thanks for the clarification, it seems CVS will also expand $Id: $

A)
From some CVS docs:

This CVS keyword will expand out to the name of the RCS file (which is the name of the file plus a ,v), the revision number, the last modified date, and the username of the person to last modify the file.

Shouldn't we replace $Id$ with something that will show the same info? Perhaps even on one line as well? (although the full sha1 is long for that)

B)
So for replacing unexpanded the following regex should work \$Id[^$]*\$

we may need to rewrite ALL of the history with this new formatting system

Does that mean we need to replace expanded version of $Id$ as well?

Log in or register to post comments

Comment #17

rocketeerbkw CreditAttribution: rocketeerbkw commented 5 July 2010 at 00:50

I should note that \$Id[^$]*\$ is PCRE in case the script that's eventually written doesn't support PCRE

Log in or register to post comments

Comment #18

pwolanin CreditAttribution: pwolanin commented 5 July 2010 at 02:29

I thought the conclusion in IRC was to leave the expanded $Id$ tags in the imported files, and replace them in the tip of each branch so they can be git archive expaned when we make releases?

Log in or register to post comments

Comment #19

sdboyer CreditAttribution: sdboyer commented 5 July 2010 at 14:31

@xmacinfo: chx had some ideas about that. I think he was happy with just %ai ultimately. But I'd like to see people make some proposals here, discuss pros/cons :)

@rocketeerbkw: yes, that's the conclusion other people helped me to come to, and what this discussion should now be about - what to replace $Id$ tags with that'll give similarly useful data once we're into git.

@pwolanin: I wasn't clear on leaving them expanded in the historical files, but it does seem like that's the preferred direction, so sure. (That also has the side benefit of making it relatively easy to set up the migration steps so that other people can test it, as they'll no longer need to hack cvs2svn). And then, yes, the script should replace the expanded tags in the tip of each branch. That does mean we'll have historical CVS metadata in there which doesn't actually refer to anything real or check-out-able, but that's probably OK.

Also, it's worth noting that I'm not quite sure how these placeholder expansions work - if they just expand the date of the last commit regardless of whether a file was modified in that commit, they'll be much more limited in their usefulness. I hope that's not what they do, but we ARE moving away from a non-atomic system. I'll experiment with that when I get a chance.

Log in or register to post comments

Comment #20

xmacinfo

he/him

French

Canada

CreditAttribution: xmacinfo commented 5 July 2010 at 15:11

Let's hope the placeholder will expand correctly. I was not aware that GIT offered these.

And as CHX, I shall be happy with %ai. I don't think any other information is relevant for the end-users.

Log in or register to post comments

Comment #21

rocketeerbkw CreditAttribution: rocketeerbkw commented 6 July 2010 at 00:41

File	Size
git_attribute_expansion_testing.png	85.29 KB

It appears they are expanded in all files based on the most recent commit. I've attached a screenshot that demonstrates this.

I used git archive -o test.zip HEAD to expand the text in file1/2/3.txt

Log in or register to post comments

Comment #22

xmacinfo

he/him

French

Canada

CreditAttribution: xmacinfo commented 6 July 2010 at 01:51

The more I think about this, the more I think we should inject one way or the other a revision info only in two files:

setting.php
.htaccess

Can a script inject some type of revision info (the date being sufficient, here) inside those two files?

Log in or register to post comments

Comment #23

pwolanin CreditAttribution: pwolanin commented 6 July 2010 at 02:55

@xmacinfo - we are not just talking about Drupal core here, but rather every contrib module as well.

Log in or register to post comments

Comment #24

xmacinfo

he/him

French

Canada

CreditAttribution: xmacinfo commented 6 July 2010 at 03:41

@pwolanin - Granted. My main concerns are for core. ;-)

Log in or register to post comments

Comment #25

Barry_Fisher CreditAttribution: Barry_Fisher commented 10 July 2010 at 09:59

Should it/ could it be the case that there is the option of expansion rules presented when downloading a release?

I guess this would come under the new Project module integration? It would seem that by having these tags remaining is a convenience for people not using version control on production servers. Perhaps I'm missing some other uses here but I think how the Id tags will remain to be useful will depend on the end user/developer. For a crude way of comparing latest files- I would just look at the modification date- but that's just me. Anything more complex then a standard diff is easy enough- especially with IDEs and GUI tools as good as they are.

The situation with most shared hosts is that version control can't be used and so expansion tags could be useful for some folks. For dedicated machines less so.

My point here is that it's horses for courses and the user should have the ability to decide whether expansion tags are included in downloads.

Any thoughts on how a choice could be integrated?

An afterthought.... should Id tags be left out altogether from source files as per discussed before and then have a date/version injected into the download if the user wants it?

Log in or register to post comments

Comment #26

fgm

French

Paris, France

CreditAttribution: fgm commented 10 July 2010 at 14:49

I discussed this issue (like probably most people coming from SCCS/RCS/CVS do) a few months ago with several people on #git and the best (IMHO) suggestion, once past their initial outrage ("one of /these/ again") that came out of it was to

- do not touch the individual files in a normal checkout/pull, à la CVS
- generate the version info in one file in the packaged releases
- yes, even -dev releases, and maybe especially them, since these are the only ones which do not carry a meaningful version information otherwise,

this provides a easy, human-readable way, of identifying which version of a package is actually being used, and does not mess with the people using live checkouts and suggesting patches. These can use git themselves to examine version information anyway.

Log in or register to post comments

Comment #27

chx CreditAttribution: chx commented 11 July 2010 at 00:42

So here is the scenario: you have a client, unknown origins. She has some Drupal. You need to determine and quickly whether module X is up to date. Right now, regardless of what way that file landed there it will have a definite version number in it and anyone capable of operating an FTP client can do it. Even over the phone. Easily. This is not something we want to lose. Therefore adding keywords on packaging is unacceptable because what if the original developer have not used the package but git? And then uploaded via FTP and now there is no git to check with? Next, what if a shop does not use git, their checkout won't have drupal.org versions? This sounds bad.

Log in or register to post comments

Comment #28

fgm

French

Paris, France

CreditAttribution: fgm commented 11 July 2010 at 06:20

@chx: this is exactly how I presented the problem at that time, and the solution satisfied me, considering anything on a customer site would be either
- a production release (hence packaged and even with a clean drupal version)
- a dev version (hence packaged, with drupal date info)
- a VCS (whichever) checkout done on-site (hence containing VCS version info)

However, I had not envisioned a developer pushing to a customer something that is not even a -dev version (hence packaged), AND doing so not via a VCS checkout, but a FTP from his own checkout without the VCS info. That sounds real bad practice, but after having audited some customer sites, I can now believe anything - including this - can happen.

So how about a checkout hook generating a version file, and that file being removed by a pre-commit hook: that way the files stay pristine for git comparisons, and there is still a human-readable version.

Log in or register to post comments

Comment #29

pwolanin CreditAttribution: pwolanin commented 11 July 2010 at 13:20

A thought - in that case maybe the .info file in git shoudl be a template (.info.tpl or some such), and the checkout could create the functional .info file?

The problem here is that I'm not sure if these sorts of checkout hooks come along when you do a git clone.

Log in or register to post comments

Comment #30

sdboyer CreditAttribution: sdboyer commented 11 July 2010 at 17:00

@fgm: yeah, that's the basic pattern of argument I've made in the past - that the metadata is only useful if you're working with a packaged release, not something that's been directly cloned out. In the latter case, you should just use git to find out the state of items. And yes, given what rocketeerbkw demonstrated, it's quite pointless to have keyword expansion in all files if they're just going to reflect the latest snapshot, not data pertinent to that specific file.

There's no question that we'll need to have at least a single file with the latest version in it. The real question is whether or not that's going to be enough to satisfy these other, legitimate use cases. If it isn't, I can only think of two choices: we patch git to create the sort of placeholders we need, or we write something more elaborate into the packaging process that basically does its own keyword expansion. The latter might simply be prohibitively expensive in terms of sheer number of git commands it runs: for every branch being packaged, it would need to run something like git rev-list --max-count=1 --topo-order <branch name> -- <file name>. Git's fast, but unless we could figure out a way to batch that operation, I don't know if it's feasible.

Log in or register to post comments

Comment #31

webchick

she/they

English

Vancouver 🇨🇦

CreditAttribution: webchick commented 11 July 2010 at 17:52

Personally, the "person who was savvy enough to build their site with Git checkouts on localhost but only has FTP access to the server" use case is not compelling enough for me to possibly delay this entire process for weeks trying to figure out how to hack in support for keyword expansion into Git. If they were savvy enough to checkout the stuff from Git in the first place, they're savvy enough to re-download the stuff from the sever and use Git to check the version numbers from their local computer. I guess my only concern is whether we can have the equivalent of a "Git deploy" module without this. If so, I don't see this as something worth expending a whole lot of effort on it, when there is a lot of other things to do to get the migration done.

I am, however, curious what the Git equivalent of 'svn info' is. I couldn't seem to find one that was not a shell script on someone's blog. But as long as there's a way for someone with a Git checkout to run git foo command and get the commit number the files correspond to, I'm fine with replacing $Id$ with nothing, personally.

Log in or register to post comments

Comment #32

fgm

French

Paris, France

CreditAttribution: fgm commented 12 July 2010 at 06:12

@webchick: this script provides information similar to svn info, although not in a format as easily parseable. Could be a start.

Log in or register to post comments

Comment #33

hunmonk CreditAttribution: hunmonk commented 12 July 2010 at 22:07

sdboyer and i were discussing this issue more today. i agree w/ webchick that we should definitely not over-complicate this issue. given that, i'd like to propse the following plan:

clobber all $Id$ tags everywhere, even historical commits. this completely eliminates possible merge conflict problems
as part of the packaging, run a script that takes the git checkout in question and builds a FILE_MANIFEST.txt (or whatever it would be called) and drops it into the root of the checkout prior to tarballing/zipping. the manifest would contain the names of all files in the checkout with the same kind of info we find useful in the current $Id$ tags (date last modified, etc)
rebuild all the d.o packages so even older point releases have the correct manifest

this solves the issue in a simple, straightforward manner for anybody that a) downloads packages, or b) checks out via git, which should be the vast majority of use cases.

Log in or register to post comments

Comment #34

sdboyer CreditAttribution: sdboyer commented 12 July 2010 at 22:18

Addendums to hunmonk's items:

a. Clobbering (that is, unexpanding) $Id$ everywhere means we'll have no problems with stale tags. Think about it - after we migrate, if we leave $Id$ tags expanded then they'll remain unchanged in files, telling people something very WRONG about the file version. And no, it's not as easy to just clobber tags at branch tips - that would require a separate script, very much like the one this thread was originally, and should maybe again be, about.
b. Including last-modified SHA1/date for each file in a package is going to be bound by the issues I described in the latter half of #819874-30: Decide on a replacement for $Id$ tags in files. It might turn out to be fine, but I'm gonna be worried about it until we actually benchmark it.

Log in or register to post comments

Comment #35

dww

we/he/they

CreditAttribution: dww commented 12 July 2010 at 22:23

Since webchick raises it in #31, let me just say as the maintainer of the cvs_deploy module that it doesn't care about $Id$ tags at all. It's just directly inspecting the CVS metadata about a directory (the stuff you could find out with "cvs status") and using some hooks to tell update status/manager about that. So, FWIW, this thread has no bearing on the feasibility of a git_deploy module whatsoever.

Cheers,
-Derek

Log in or register to post comments

Comment #36

kbahey CreditAttribution: kbahey commented 12 July 2010 at 22:28

@hunmonk

I basically like the manifest file idea.

What I am not clear on is under such a scenario, how do we address the following use cases?

1. Someone deploys a site via CVS, but then edit them locally. I know this is bad, but it does happen. Right now with CVS and $Id$, I have a starting point on what the original files were, and can diff them against it to see if they were hacked.

2. Someone deploys a site via a Git checkout, then edits them locally. I don't have $Id$ anymore, so I don't know which branch they deployed from, so I can diff/merge against it.

Log in or register to post comments

Comment #37

dww

we/he/they

CreditAttribution: dww commented 12 July 2010 at 22:31

Oh, and while I'm spreading useful info, please see #606592-4: Allow updating core with the update manager about how a manifest file could be useful for the update manager.

Log in or register to post comments

Comment #38

apaderno

he/him

Italian

Brescia, 🇮🇹 🇪🇺

CreditAttribution: apaderno commented 12 July 2010 at 22:33

What I am not clear on is under such a scenario, how do we address the following use cases?

1. Someone deploys a site via CVS, but then edit them locally. I know this is bad, but it does happen. Right now with CVS and $Id$, I have a starting point on what the original files were, and can diff them against it to see if they were hacked.

2. Someone deploys a site via a Git checkout, then edits them locally. I don't have $Id$ anymore, so I don't know which branch they deployed from, so I can diff/merge against it.

Those cases seem similar to the case reported from webchick in comment #31, for which she says we should not be interested in edge cases.

Log in or register to post comments

Comment #39

hunmonk CreditAttribution: hunmonk commented 12 July 2010 at 22:42

@kbahey: wouldn't it be best practice in this case to deploy a checkout from a git clone, and include the repo with the uploaded files? i think that would be possible, and thus allow for querying the repo if necessary, even if you have to re-download it from a shared server to do so. i guess that equals more bandwidth/disk space used, but worth the trade off i would think.

Log in or register to post comments

Comment #40

webchick

she/they

English

Vancouver 🇨🇦

CreditAttribution: webchick commented 12 July 2010 at 22:44

Well, also, unless I'm reading #33b wrong, said evil hacker of core would have their MANIFEST.txt file to reference to know where (branch/tag/whatever), who (committer) and when (timestamp) the files originally came from, no? Isn't that just as good as $Id$? Or possibly better, depending on what other goodies we put in MANIFEST.txt?

Log in or register to post comments

Comment #41

kbahey CreditAttribution: kbahey commented 12 July 2010 at 23:20

@hunmonk

I was not advocating the above use cases. I was stating what do you do if you are faced with such a case.

To summarize, the use case is more like: "you took over a site from someone else who went against best practices" what is the course of action here?

If we use %awhatever in the file itself in a comment (i.e. replacing $Id$ with %aX), it would have the same effect as $Id$.

Granted, this is an edge case, and should not derail the whole process.

Log in or register to post comments

Comment #42

marvil07 CreditAttribution: marvil07 commented 13 July 2010 at 00:03

@webchick: about "git info": http://marvil07.soup.io/post/65026709/git-info (kind of OT, that's why it's outside)

Log in or register to post comments

Comment #43

sdboyer CreditAttribution: sdboyer commented 6 August 2010 at 07:02

Assigned:

ryanaghdam

» Unassigned

Unassigning.

Log in or register to post comments

Comment #44

webchick

she/they

English

Vancouver 🇨🇦

CreditAttribution: webchick commented 6 August 2010 at 07:11

Tagging. This is a task that might be good for a volunteer to tackle once we get a decision here.

Log in or register to post comments

Comment #45

webchick

she/they

English

Vancouver 🇨🇦

CreditAttribution: webchick commented 6 August 2010 at 07:36

Issue tags:

+git low hanging fruit

Oops. :P

Log in or register to post comments

Comment #46

sirkitree CreditAttribution: sirkitree commented 6 August 2010 at 11:27

I'd certainly prefer not to have info appended to my files. A separate manifest sounds like an awesome thing to have. We're already moving that way in our projects as it is with .make files to register what all (core, contrib, custom) is in the project so this approach totally makes sense to me for individual modules and such.

Also would love to know where some documentation is on this whole process to that I could actually try it out and work on this issue once a decision is made.

Log in or register to post comments

Comment #47

fgm

French

Paris, France

CreditAttribution: fgm commented 6 August 2010 at 16:35

Why could this "manifest" file not be the .info file ? It would mesh nicely with other uses of the info file to hold version information and source URL, it seems.

Log in or register to post comments

Comment #48

xmacinfo

he/him

French

Canada

CreditAttribution: xmacinfo commented 6 August 2010 at 21:17

The .info file is parsed by Drupal for dependencies, version number, description, etc.

The MANIFEST.TXT file would not be parsed by Drupal and the content of this file would serve as information only.

Log in or register to post comments

Comment #49

sdboyer CreditAttribution: sdboyer commented 7 August 2010 at 03:16

@fgm - First, info files are already a morbidly obese hodgepodge. Also, info files need a fully bootstrapped drupal context to make sense - a good manifest needs nothing more than md5, cat and find to be verified. Maybe most important, if we're creating a list of hashes of every file in a package, then that list can't be put IN one of the files it needs to hash.

Log in or register to post comments

Comment #50

fgm

French

Paris, France

CreditAttribution: fgm commented 7 August 2010 at 09:07

@xmacinfo: yes, that was precisely the point

@sdboyer: about obese format: true; especially seeing the format extensions Features puts there. bootstrapped drupal not needed, though. But the third argument kills the suggestion indeed, you're right.

This leaves us in a somehow annoying situation, though: we will then have TWO metadata files in packages, the new manifest file, and the info file, which will surely be derided by critics. It reminds me of the ephemeral .schema files introduced then shelved during the D6 development process.

Log in or register to post comments

Comment #51

sdboyer CreditAttribution: sdboyer commented 9 August 2010 at 17:08

Taking a note from gentoo, which has just about the most crazy-powerful packaging system in the universe:

$ ls -1lA /usr/portage/www-servers/apache
total 132K
139800 -rw-r--r-- 1 root root 109K Jul 11 01:37 ChangeLog
139801 -rw-r--r-- 1 root root 1.6K Jul 11 01:37 Manifest
139802 -rw-r--r-- 1 root root 2.3K Jul 11 01:37 apache-2.2.14-r1.ebuild
139805 -rw-r--r-- 1 root root 2.3K Jul 11 01:37 apache-2.2.15.ebuild
140235 -rw-r--r-- 1 root root  622 Jan 11  2010 metadata.xml

$ cat /usr/portage/www-servers/apache/Manifest
DIST gentoo-apache-2.2.14-r1-20091008.tar.bz2 62359 RMD160 0e78de9a61265be2ef797e02bce0cf89f0a5fd2a SHA1 357316581f7d7d289655992216be6c5f5342f32c SHA256 99db378884b33af1c97713f63d92f0bb1d02eef6dc1f8f47a9addd258b3f7233
DIST gentoo-apache-2.2.15-20100307.tar.bz2 63716 RMD160 aa16c46ec930c020820293b884876946b81bd476 SHA1 20fa102d6094d00d3c874b0b1df69d0ddcf34339 SHA256 b3c4ca6eed24ea82ff37bfa331403b09c94f3b2a8b5b1058761651c6824787c1
DIST httpd-2.2.14.tar.bz2 5147171 RMD160 ff5077e444ba995475202bb3b9be733384c809d1 SHA1 eacd04c87b489231ae708c84a77dc8e9ee176fd2 SHA256 b2deab8a5e797fde7a04fb4a5ebfa9c80f767d064dd19dcd2857c94838ae3ac6
DIST httpd-2.2.15.tar.bz2 4959582 RMD160 e5c5da1fdf86a6b0501f6c8e97ccb1982e81cfdf SHA1 5f0e973839ed2e38a4d03adba109ef5ce3381bc2 SHA256 5ae0c428e7abd87eecbac8564d90a7182104325bae7086c21db7b3a1e3140ca7
EBUILD apache-2.2.14-r1.ebuild 2275 RMD160 347fabe296dafc6bbed9f45d8f6b102659c58495 SHA1 bf266591b858a3d59c2b6c0d029ec05bd073b2d9 SHA256 2cd9e5df7b8302247aba75eaea51901defd5b998a911e599fb3ae3a4928d318c
EBUILD apache-2.2.15.ebuild 2326 RMD160 6d56b8771691af5ad1afb35f1b7b0c76a1c0a317 SHA1 93bad89a62c4c5f22f2eafcda0187186c7820c54 SHA256 2d4d2c0944904f405332893dd7ffe62a6ea4d9ee77a516b64a48b185da933ab0
MISC ChangeLog 110971 RMD160 f64bb52e5a80bf318995de2d0474e046ece52df4 SHA1 1cba45b7a7b577cc5e21c7530d63cffd17edf469 SHA256 96c03d427af4e776fbf949ead393c91d55c755270fab6e3730264a7180dee148
MISC metadata.xml 622 RMD160 217aba625932dbcfcb8bcee1f8a38e6f248e65a7 SHA1 1e7494fe8e49166c8b87a178aca78c9a658f92c7 SHA256 eb6d4f305a170e97bf29e5c6e9d1df6edea8fcdabd35ac2d99ebc8fe5bbd71b4

That's 1) a metadata xml file for displaying info when searching in portage, 2) a Manifest file containing hashes, 3) a Changelog corresponding to changes in portage not (just) new apache releases, and 3) individual ebuild files that contain build instructions. Critics can bite me. :)

Log in or register to post comments

Comment #52

webchick

she/they

English

Vancouver 🇨🇦

CreditAttribution: webchick commented 9 August 2010 at 19:38

Issue tags:

-git phase 2

+git phase 1

Fixing tag.

Log in or register to post comments

Comment #54

xmacinfo

he/him

French

Canada

CreditAttribution: xmacinfo commented 9 August 2010 at 21:35

Although the manifext example on #51 is impressive. But for end usersm it's missing key information:

The date of last change. ;-)

Log in or register to post comments

Comment #55

sdboyer CreditAttribution: sdboyer commented 10 August 2010 at 06:07

Issue tags:

-git phase 1

+git phase 2

Should be phase 2.

Log in or register to post comments

Comment #56

sdboyer CreditAttribution: sdboyer commented 12 August 2010 at 00:41

@xmacinfo - The manifest in #51 is from gentoo, is VERY much not human-facing in their system, and is just an example. Not necessarily what we're going to go with. Including a datestamp in that big line of data wouldn't be difficult. Humans can use the datestamp, and machines can use the hashes. Everyone's happy.

Also, just a quick note because I just noticed #41 - @kbahey, the whole approach of using placeholder expansion is moot because it expands based on the current commit, whether or not the file was modified in that commit. So all placeholders look identical.

I'm gonna write up a definitive recommendation on a path forward for this soon.

Log in or register to post comments

Comment #57

webchick

she/they

English

Vancouver 🇨🇦

CreditAttribution: webchick commented 25 August 2010 at 12:57

Project:	Drupal.org infrastructure	» The Great Git Migration
Component:	Git	» Migration scripts
Priority:	Minor	» Normal

Log in or register to post comments

Comment #58

chrisstrahl CreditAttribution: chrisstrahl commented 7 September 2010 at 19:30

Assigned:	Unassigned	» sdboyer
Issue tags:		+git sprint 1

sdboyer to publish an update on this and resolve the issue.

Log in or register to post comments

Comment #59

sdboyer CreditAttribution: sdboyer commented 17 September 2010 at 04:30

Title:	Decide on a replacement for $Id$, then create a script to automatically update projects	» Decide on a replacement for $Id$ tags in files
Status:	Active	» Fixed

Man, I hate it when "very soon" turns into more than a month. Well, better late than never - here's how we should resolve this one. Please reopen the issue if there are glaring problems with what I've laid out.

There are a few pieces to this, so I'll break it out item by item.

We will be clobbering all historical $Id$ tags; git.drupalcode.org already does this, and has been doing it for a while. This'll prevent merge conflicts, will ensure stale information doesn't make it into the new, shiny git repos and end up confusing people.
To make for a clean break, we need a script that can be run as part of the migration process to kill all remnants of $Id$ in all projects. I've opened a separate issue for that (#914280: Create a script to remove all $Id$ tags from projects), so that this issue can be just about the decision.
Because of the limitations of git's keyword expansion, we will NOT be replacing $Id$ with anything else. Drupal files will be unadorned by in-file meta-packaging info.
The packaging scripts will need to be expanded to include a MANIFEST-like file along the lines of #51. I've opened #914284: Update packaging scripts to create some sort of MANIFEST for this, though there might already be one open.

Log in or register to post comments

Comment #60

1 October 2010 at 04:40

Status:

Fixed

» Closed (fixed)

Automatically closed -- issue fixed for 2 weeks with no activity.

Log in or register to post comments

Comment #61

chx CreditAttribution: chx commented 1 October 2010 at 21:26

Note that although noone asked for me, a manifest file does work for me. It works with the "hapless client over the phone case". The manifest is like any other file in a module package so it gets updated the same time no matter the deployment. Good enough for me.