Git Explained
Git [6] is a distributed version control system that tracks versions of files. It is often used to control source code by programmers collaboratively developing software.
Design goals of Git include speed, data integrity, and support for distributed, non-linear workflows — thousands of parallel branches running on different computers.[7] [8] [9]
Git was created for use in the development of the Linux kernel by Linus Torvalds and others developing the kernel.[10]
As with most other distributed version control systems, and unlike most client–server systems, Git maintains a local copy of the entire repository, a.k.a. repo, with history and version-tracking abilities, independent of network access or a central server. A repo is stored on each computer in a standard directory with additional, hidden files to provide version control capabilities.[11] Git provides features to synchronize changes between repos that share history; copied (cloned) from each other. For collaboration, Git supports synchronizing with repos on remote machines. Although all repos (with the same history) are peers, developers often use a central server to host a repo to hold an integrated copy.
Git is a free and open-source software shared under the GPL-2.0-only license.
The trademark "Git" is registered by the Software Freedom Conservancy, marking its official recognition and continued evolution in the open-source community.
Today, Git is the de facto standard version control system. It is the most popular distributed version control system, with nearly 95% of developers reporting it as their primary version control system as of 2022. It is the most widely used source-code management tool among professional developers. There are offerings of Git repository services, including GitHub, SourceForge, Bitbucket and GitLab.[12]
History
Torvalds started developing Git in April 2005 after the free license for BitKeeper, the proprietary source-control management (SCM) system used for Linux kernel development since 2002, was revoked for Linux.[13] [14] The copyright holder of BitKeeper, Larry McVoy, claimed that Andrew Tridgell had created SourcePuller by reverse engineering the BitKeeper protocols.[15] The same incident also spurred the creation of Mercurial, another version-control system.
Torvalds wanted a distributed system that he could use like BitKeeper, but none of the available free systems met his needs. He cited an example of a source-control management system needing 30 seconds to apply a patch and update all associated metadata, and noted that this would not scale to the needs of Linux kernel development, where synchronizing with fellow maintainers could require 250 such actions at once. For his design criterion, he specified that patching should take no more than three seconds, and added three more goals:[7]
- Take the Concurrent Versions System (CVS) as an example of what not to do; if in doubt, make the exact opposite decision.[9]
- Support a distributed, BitKeeper-like workflow.[9]
- Include very strong safeguards against corruption, either accidental or malicious.[8]
These criteria eliminated every version-control system in use at the time, so immediately after the 2.6.12-rc2 Linux kernel development release, Torvalds set out to write his own.[9]
The development of Git began on 3 April 2005.[16] Torvalds announced the project on 6 April and became self-hosting the next day.[16] [17] The first merge of multiple branches took place on 18 April.[18] Torvalds achieved his performance goals; on 29 April, the nascent Git was benchmarked recording patches to the Linux kernel tree at a rate of 6.7 patches per second.[19] On 16 June, Git managed the kernel 2.6.12 release.[20]
Torvalds turned over maintenance on 26 July 2005 to Junio Hamano, a major contributor to the project.[21] Hamano was responsible for the 1.0 release on 21 December 2005.[22]
Naming
Torvalds sarcastically quipped about the name git (which means "unpleasant person" in British English slang): "I'm an egotistical bastard, and I name all my projects after myself. First 'Linux', now 'git'."[23] [24] The man page describes Git as "the stupid content tracker".[25]
The read-me file of the source code elaborates further:[26]
The source code for Git refers to the program as "the information manager from hell".
Characteristics
Design
Git's design is a synthesis of Torvalds's experience with Linux in maintaining a large distributed development project, along with his intimate knowledge of file-system performance gained from the same project and the urgent need to produce a working system in short order. These influences led to the following implementation choices:[10]
- Strong support for non-linear development: Git supports rapid branching and merging, and includes specific tools for visualizing and navigating a non-linear development history. In Git, a core assumption is that a change will be merged more often than it is written, as it is passed around to various reviewers. In Git, branches are very lightweight: a branch is only a reference to one commit.
Distributed development: Like Darcs, BitKeeper, Mercurial, Bazaar, and Monotone, Git gives each developer a local copy of the full development history, and changes are copied from one such repository to another. These changes are imported as added development branches and can be merged in the same way as a locally developed branch.[27]
Compatibility with existing systems and protocols: Repositories can be published via Hypertext Transfer Protocol Secure (HTTPS), Hypertext Transfer Protocol (HTTP), File Transfer Protocol (FTP), or a Git protocol over either a plain socket or Secure Shell (ssh). Git also has a CVS server emulation, which enables the use of existing CVS clients and IDE plugins to access Git repositories. Subversion repositories can be used directly with git-svn.[28]
Efficient handling of large projects: Torvalds has described Git as being very fast and scalable,[29] and performance tests done by Mozilla[30] showed that it was an order of magnitude faster diffing large repositories than Mercurial and GNU Bazaar; fetching version history from a locally stored repository can be one hundred times faster than fetching it from the remote server.[31]
Cryptographic authentication of history: The Git history is stored in such a way that the ID of a particular version (a commit in Git terms) depends upon the complete development history leading up to that commit. Once it is published, it is not possible to change the old versions without it being noticed. The structure is similar to a Merkle tree, but with added data at the nodes and leaves.[32] (Mercurial and Monotone also have this property.)
Toolkit-based design: Git was designed as a set of programs written in C and several shell scripts that provide wrappers around those programs.[33] Although most of those scripts have since been rewritten in C for speed and portability, the design remains, and it is easy to chain the components together.[34]
Pluggable merge strategies: As part of its toolkit design, Git has a well-defined model of an incomplete merge, and it has multiple algorithms for completing it, culminating in telling the user that it is unable to complete the merge automatically and that manual editing is needed.[35]
Garbage accumulates until collected: Aborting operations or backing out changes will leave useless dangling objects in the database. These are generally a small fraction of the continuously growing history of wanted objects. Git will automatically perform garbage collection when enough loose objects have been created in the repository. Garbage collection can be called explicitly using git gc
.[36]
Periodic explicit object packing: Git stores each newly created object as a separate file. Although individually compressed, this takes up a great deal of space and is inefficient. This is solved by the use of packs that store a large number of objects delta-compressed among themselves in one file (or network byte stream) called a packfile. Packs are compressed using the heuristic that files with the same name are probably similar, without depending on this for correctness. A corresponding index file is created for each packfile, telling the offset of each object in the packfile. Newly created objects (with newly added history) are still stored as single objects, and periodic repacking is needed to maintain space efficiency. The process of packing the repository can be very computationally costly. By allowing objects to exist in the repository in a loose but quickly generated format, Git allows the costly pack operation to be deferred until later, when time matters less, e.g., the end of a workday. Git does periodic repacking automatically, but manual repacking is also possible with the git gc
command. For data integrity, both the packfile and its index have an SHA-1 checksum inside, and the file name of the packfile also contains an SHA-1 checksum. To check the integrity of a repository, run the git fsck
command.[37]
Another property of Git is that it snapshots directory trees of files. The earliest systems for tracking versions of source code, Source Code Control System (SCCS) and Revision Control System (RCS), worked on individual files and emphasized the space savings to be gained from interleaved deltas (SCCS) or delta encoding (RCS) the (mostly similar) versions. Later revision-control systems maintained this notion of a file having an identity across multiple revisions of a project. However, Torvalds rejected this concept.[38] Consequently, Git does not explicitly record file revision relationships at any level below the source-code tree.
Downsides
These implicit revision relationships have some significant consequences:
- It is slightly more costly to examine the change history of one file than the whole project.[39] To obtain a history of changes affecting a given file, Git must walk the global history and then determine whether each change modified that file. This method of examining history does, however, let Git produce with equal efficiency a single history showing the changes to an arbitrary set of files. For example, a subdirectory of the source tree plus an associated global header file is a very common case.
- Renames are handled implicitly rather than explicitly. A common complaint with CVS is that it uses the name of a file to identify its revision history, so moving or renaming a file is not possible without either interrupting its history or renaming the history and thereby making the history inaccurate. Most post-CVS revision-control systems solve this by giving a file a unique long-lived name (analogous to an inode number) that survives renaming. Git does not record such an identifier, and this is claimed as an advantage.[40] [41] Source code files are sometimes split or merged, or simply renamed,[42] and recording this as a simple rename would freeze an inaccurate description of what happened in the (immutable) history. Git addresses the issue by detecting renames while browsing the history of snapshots rather than recording it when making the snapshot.[43] (Briefly, given a file in revision N, a file of the same name in revision N − 1 is its default ancestor. However, when there is no like-named file in revision N − 1, Git searches for a file that existed only in revision N − 1 and is very similar to the new file.) However, it does require more CPU-intensive work every time the history is reviewed, and several options to adjust the heuristics are available. This mechanism does not always work; sometimes a file that is renamed with changes in the same commit is read as a deletion of the old file and the creation of a new file. Developers can work around this limitation by committing the rename and the changes separately.
Merging strategies
Git implements several merging strategies; a non-default strategy can be selected at merge time:[44]
- resolve: the traditional three-way merge algorithm.
- recursive: This is the default when pulling or merging one branch, and is a variant of the three-way merge algorithm.
- octopus: This is the default when merging more than two heads.
Data structures
Git's primitives are not inherently a source-code management system. Torvalds explains:[45]
From this initial design approach, Git has developed the full set of features expected of a traditional SCM,[46] with features mostly being created as needed, then refined and extended over time.
Git has two data structures: a mutable index (also called stage or cache) that caches information about the working directory and the next revision to be committed; and an object database that stores immutable objects.
The index serves as a connection point between the object database and the working tree.
The object store contains five types of objects:[47] [37]
- A blob is the content of a file. Blobs have no proper file name, time stamps, or other metadata (a blob's name internally is a hash of its content). In Git, each blob is a version of a file, in which is the file's data.
- A tree object is the equivalent of a directory. It contains a list of file names, each with some type bits and a reference to a blob or tree object that is that file, symbolic link, or directory's contents. These objects are a snapshot of the source tree. (In whole, this comprises a Merkle tree, meaning that only a single hash for the root tree is sufficient and actually used in commits to precisely pinpoint to the exact state of whole tree structures of any number of sub-directories and files.)
- A commit object links tree objects together into history. It contains the name of a tree object (of the top-level source directory), a timestamp, a log message, and the names of zero or more parent commit objects.
- A tag object is a container that contains a reference to another object and can hold added meta-data related to another object. Most commonly, it is used to store a digital signature of a commit object corresponding to a particular release of the data being tracked by Git.
- A packfile object collects various other objects into a zlib-compressed bundle for compactness and ease of transport over network protocols.
Each object is identified by a SHA-1 hash of its contents. Git computes the hash and uses this value for the object's name. The object is put into a directory matching the first two characters of its hash. The rest of the hash is used as the file name for that object.
Git stores each revision of a file as a unique blob. The relationships between the blobs can be found through examining the tree and commit objects. Newly added objects are stored in their entirety using zlib compression. This can consume a large amount of disk space quickly, so objects can be combined into packs, which use delta compression to save space, storing blobs as their changes relative to other blobs.
Additionally, Git stores labels called refs (short for references) to indicate the locations of various commits. They are stored in the reference database and are respectively:[48]
- Heads (branches): Named references that are advanced automatically to the new commit when a commit is made on top of them.
- HEAD: A reserved head that will be compared against the working tree to create a commit.
- Tags: Like branch references, but fixed to a particular commit. Used to label important points in history.
Commands
Frequently used commands for Git's command-line interface include:[49] [50]
git init
, which is used to create a git repository.
git clone [URL]
, which clones, or duplicates, a git repository from an external URL.
git add [file]
, which adds a file to git's working directory (files about to be committed).
git commit -m [commit message]
, which commits the files from the current working directory (so they are now part of the repository's history).
A .gitignore file may be created in a Git repository as a plain text file. The files listed in the .gitignore file will not be tracked by Git.[51] This feature can be used to ignore files with keys or passwords, various extraneous files, and large files (which GitHub will refuse to upload).[52]
Git references
Every object in the Git database that is not referred to may be cleaned up by using a garbage collection command or automatically. An object may be referenced by another object or an explicit reference. Git has different types of references. The commands to create, move, and delete references vary. git show-ref
lists all references. Some types are:
- heads: refers to an object locally,
- remotes: refers to an object which exists in a remote repository,
- stash: refers to an object not yet committed,
- meta: e.g., a configuration in a bare repository, user rights; the refs/meta/config namespace was introduced retrospectively, gets used by Gerrit,[53]
- tags: see above.
Implementations
Git (the main implementation in C) is primarily developed on Linux, although it also supports most major operating systems, including the BSDs (DragonFly BSD, FreeBSD, NetBSD, and OpenBSD), Solaris, macOS, and Windows.[54] [55]
The first Windows port of Git was primarily a Linux-emulation framework that hosts the Linux version. Installing Git under Windows creates a similarly named Program Files directory containing the Mingw-w64 port of the GNU Compiler Collection, Perl 5, MSYS2 (itself a fork of Cygwin, a Unix-like emulation environment for Windows) and various other Windows ports or emulations of Linux utilities and libraries. Currently, native Windows builds of Git are distributed as 32- and 64-bit installers.[56] The git official website currently maintains a build of Git for Windows, still using the MSYS2 environment.[57]
The JGit implementation of Git is a pure Java software library, designed to be embedded in any Java application. JGit is used in the Gerrit code-review tool, and in EGit, a Git client for the Eclipse IDE.[58]
Go-git is an open-source implementation of Git written in pure Go.[59] It is currently used for backing projects as a SQL interface for Git code repositories and providing encryption for Git.[60]
Dulwich is an implementation of Git written in pure Python with support for CPython 3.6 and later and Pypy.[61]
The libgit2 implementation of Git is an ANSI C software library with no other dependencies, which can be built on multiple platforms, including Windows, Linux, macOS, and BSD.[62] It has bindings for many programming languages, including Ruby, Python, and Haskell.[63] [64] [65]
JS-Git is a JavaScript implementation of a subset of Git.[66]
GameOfTrees is an open-source implementation of Git for the OpenBSD project.[67]
Git server
As Git is a distributed version control system, it could be used as a server out of the box. It is shipped with a built-in command git daemon
which starts a simple TCP server running on the Git protocol.[68] Dedicated Git HTTP servers help (amongst other features) by adding access control, displaying the contents of a Git repository via the web interfaces, and managing multiple repositories. Already existing Git repositories can be cloned and shared to be used by others as a centralized repo. It can also be accessed via remote shell just by having the Git software installed and allowing a user to log in.[69] Git servers typically listen on TCP port 9418.[70]
Open source
- Hosting the Git server using the Git Binary.[71]
- Gerrit, a Git server configurable to support code reviews and provide access via ssh, an integrated Apache MINA or OpenSSH, or an integrated Jetty web server. Gerrit provides integration for LDAP, Active Directory, OpenID, OAuth, Kerberos/GSSAPI, X509 https client certificates. With Gerrit 3.0 all configurations will be stored as Git repositories, and no database is required to run. Gerrit has a pull-request feature implemented in its core but lacks a GUI for it.
- Phabricator, a spin-off from Facebook. As Facebook primarily uses Mercurial, Git support is not as prominent.[72]
- RhodeCode Community Edition (CE), supporting Git, Mercurial and Subversion with an AGPLv3 license.
- Kallithea, supporting both Git and Mercurial, developed in Python with GPL license.
- External projects like gitolite,[73] which provide scripts on top of Git software to provide fine-grained access control.
- There are several other FLOSS solutions for self-hosting, including Gogs,[74] Gitea, a fork of Gogs, as well as Forgejo, which is, in turn, a fork of Gitea. Gogs, as well as the two aforementioned derivatives of it, is developed using the Go language. All three solutions are made available under the MIT license.
Git server as a service
See also: Comparison of source-code-hosting facilities.
There are many offerings of Git repositories as a service. The most popular are GitHub, SourceForge, Bitbucket and GitLab.[75] [76] [77] [78] [79]
Graphical interfaces
Git, a powerful version control system, can be daunting with its command-line interface. Git GUI clients offer a graphical user interface (GUI) to simplify interaction with Git repositories.
These GUIs provide visual representations of your project's history, including branches, commits, and file changes. They also streamline actions like staging changes, creating commits, and managing branches. Visual diff tools help resolve merge conflicts arising from concurrent development.
Git comes with a Tcl/Tk GUI, which allows users to perform actions such as creating and amending commits, creating and merging branches, and interacting with remote repositories. [80]
In addition to the official GUI, many 3rd party interfaces exist that provide similar features to the official GUI distributed with Git, such as GitHub Desktop, SourceTree, and TortoiseGit.[81]
GUI clients make Git easier to learn and use, improving workflow efficiency and reducing errors. Popular options include cross-platform GitKraken Desktop (freemium) and Sourcetree (free/paid), or platform-specific choices like GitHub Desktop (free) for Windows/macOS and TortoiseGit (free) for Windows.
List of GUI Clients
While Git provides built-in GUI tools (git-gui, gitk), a wider range of third-party options cater to platform-specific user preferences.
Windows GUIs (GNU GPL/MIT and Free)
Mac GUIs (GNU GPL/MIT and Free)
Linux GUIs (GNU GPL/MIT and Free)
Proprietary GIT GUI
Adoption
The Eclipse Foundation reported in its annual community survey that as of May 2014, Git is now the most widely used source-code management tool, with 42.9% of professional software developers reporting that they use Git as their primary source-control system[82] compared with 36.3% in 2013, 32% in 2012; or for Git responses excluding use of GitHub: 33.3% in 2014, 30.3% in 2013, 27.6% in 2012 and 12.8% in 2011.[83] Open-source directory Black Duck Open Hub reports a similar uptake among open-source projects.[84]
Stack Overflow has included version control in their annual developer survey[85] in 2015 (16,694 responses),[86] 2017 (30,730 responses),[87] 2018 (74,298 responses)[88] and 2022 (71,379 responses).[89] Git was the overwhelming favorite of responding developers in these surveys, reporting as high as 93.9% in 2022.
Version control systems used by responding developers:
Name | 2015 | 2017 | 2018 | 2022 |
---|
Git | 69.3% | 69.2% | 87.2% | 93.9% |
| 36.9% | 9.1% | 16.1% | 5.2% |
| 12.2% | 7.3% | 10.9% | |
| 7.9% | 1.9% | 3.6% | 1.1% |
| 4.2% | | | |
| 3.3% | | | |
| | 0.6% | | |
| | 0.4% | | |
Zip file backups | | 2.0% | 7.9% | |
Raw network sharing | | 1.7% | 7.9% | |
Other | 5.8% | 3.0% | | |
None | 9.3% | 4.8% | 4.8% | 4.3% | |
The UK IT jobs website itjobswatch.co.uk reports that as of late September 2016, 29.27% of UK permanent software development job openings have cited Git,[90] ahead of 12.17% for Microsoft Team Foundation Server,[91] 10.60% for Subversion,[92] 1.30% for Mercurial,[93] and 0.48% for Visual SourceSafe.[94] Extensions
There are many Git extensions, like Git LFS, which started as an extension to Git in the GitHub community and is now widely used by other repositories. Extensions are usually independently developed and maintained by different people, but at some point in the future, a widely used extension can be merged with Git.
Other open-source Git extensions include:
- git-annex, a distributed file synchronization system based on Git
- git-flow, a set of Git extensions to provide high-level repository operations for Vincent Driessen's branching model
- git-machete, a repository organizer & tool for automating rebase/merge/pull/push operations
Microsoft developed the Virtual File System for Git (VFS for Git; formerly Git Virtual File System or GVFS) extension to handle the size of the Windows source-code tree as part of their 2017 migration from Perforce. VFS for Git allows cloned repositories to use placeholders whose contents are downloaded only once a file is accessed.[95]
Conventions
Git can be used in a variety of different ways, but some conventions are commonly adopted.
- The command to create a local repo, git init, creates a branch named master.[96] Often it is used as the integration branch for merging changes into.[97] Since the default upstream remote is named origin, the default remote branch is origin/master. Some tools such as GitHub and GitLab create a default branch named main instead. Also, users can add and delete branches and choose any branch for integrating.
- Pushed commits generally are not overwritten, but are reverted[98] by committing another change which reverses an earlier commit. This prevents shared commits from being invalid because the commit on which they are based does not exist in the remote. If the commits contain sensitive information, they should be removed, which involves a more complex procedure to rewrite history.
- The git-flow[99] workflow and naming conventions are often adopted to distinguish feature-specific unstable histories (feature/*), unstable shared histories (develop), production-ready histories (main), and emergency patches to released products (hotfix).
- A pull request, a.k.a. merge request, is a request by a user to merge a branch into another branch.[100] Git does not itself provide for pull requests, but it is a common feature of git cloud services. The underlying function of a pull request is no different than that of an administrator of a repository pulling changes from another remote (the repository that is the source of the pull request). However, the pull request itself is a ticket managed by the hosting server which perform these actions; it is not a feature of git SCM.
Security
Git does not provide access-control mechanisms, but was designed for operation with other tools that specialize in access control.[101]
On 17 December 2014, an exploit was found affecting the Windows and macOS versions of the Git client. An attacker could perform arbitrary code execution on a target computer with Git installed by creating a malicious Git tree (directory) named .git (a directory in Git repositories that stores all the data of the repository) in a different case (such as .GIT or .Git, needed because Git does not allow the all-lowercase version of .git to be created manually) with malicious files in the .git/hooks subdirectory (a folder with executable files that Git runs) on a repository that the attacker made or on a repository that the attacker can modify. If a Windows or Mac user pulls (downloads) a version of the repository with the malicious directory, then switches to that directory, the .git directory will be overwritten (due to the case-insensitive trait of the Windows and Mac filesystems) and the malicious executable files in .git/hooks may be run, which results in the attacker's commands being executed. An attacker could also modify the .git/config configuration file, which allows the attacker to create malicious Git aliases (aliases for Git commands or external commands) or modify extant aliases to execute malicious commands when run. The vulnerability was patched in version 2.2.1 of Git, released on 17 December 2014, and announced the next day.[102] [103]
Git version 2.6.1, released on 29 September 2015, contained a patch for a security vulnerability [104] that allowed arbitrary code execution.[105] The vulnerability was exploitable if an attacker could convince a victim to clone a specific URL, as the arbitrary commands were embedded in the URL itself.[106] An attacker could use the exploit via a man-in-the-middle attack if the connection was unencrypted,[106] as they could redirect the user to a URL of their choice. Recursive clones were also vulnerable since they allowed the controller of a repository to specify arbitrary URLs via the gitmodules file.[106]
Git uses SHA-1 hashes internally. Linus Torvalds has responded that the hash was mostly to guard against accidental corruption, and the security a cryptographically secure hash gives was just an accidental side effect, with the main security being signing elsewhere.[107] [108] Since a demonstration of the SHAttered attack against git in 2017, git was modified to use a SHA-1 variant resistant to this attack. A plan for hash function transition is being written since February 2020.[109]
Trademark
"Git" is a registered word trademark of Software Freedom Conservancy under US500000085961336 since 2015-02-03.
See also
Notes and References
- Web site: Initial revision of "git", the information manager from hell . 8 April 2005 . . 20 December 2015 . live . https://web.archive.org/web/20151116175401/https://github.com/git/git/commit/e83c5163316f89bfbde7d9ab23ca2e25604af290 . 16 November 2015.
- Web site: Commit Graph . 8 June 2016 . . 19 December 2015 . live . https://web.archive.org/web/20160120121816/https://github.com/git/git/graphs/contributors . 20 January 2016.
- Web site: Git website . 9 June 2022 . https://web.archive.org/web/20220609042334/https://git-scm.com/about/small-and-fast . 9 June 2022 . live.
- Web site: Git Source Code Mirror . . 9 June 2022 . https://web.archive.org/web/20220603081319/https://github.com/git/git . 3 June 2022 . live.
- Web site: Git's GPL license at github.com . 18 January 2010 . . 12 October 2014 . live . https://web.archive.org/web/20160411135124/https://github.com/git/git/blob/master/COPYING . 11 April 2016.
- Web site: Tech Talk: Linus Torvalds on git (at 00:01:30) . YouTube . 2014-07-20 . live . https://web.archive.org/web/20151220133030/https://www.youtube.com/watch?v=4XpnKHJAok8&t=1m30s . 20 December 2015.
- linux-kernel . Torvalds . Linus . Re: Kernel SCM saga... . 2005-04-07 . 3 February 2017 . 1 July 2019 . https://web.archive.org/web/20190701210808/https://marc.info/?l=linux-kernel . live . "So I'm writing some scripts to try to track things a whole lot faster."
- Torvalds . Linus . Linus Torvalds . git . 2007-06-10 . Re: fatal: serious inflate inconsistency .
- Linus Torvalds . 2007-05-03 . Google tech talk: Linus Torvalds on git . 02:30 . 2007-05-16 . live . https://web.archive.org/web/20070528041814/http://www.youtube.com/watch?v=4XpnKHJAok8 . 28 May 2007.
- Book: https://git-scm.com/book/en/v2/Getting-Started-A-Short-History-of-Git . A Short History of Git . Pro Git . Apress . 26 December 2015 . 2014 . 2nd . live . https://web.archive.org/web/20151225223054/http://git-scm.com/book/en/v2 . 25 December 2015.
- Book: Chacon . Scott . Pro Git . 24 December 2014 . . New York, NY . 978-1-4842-0077-3 . 29–30 . 2nd . live . https://web.archive.org/web/20151225223054/http://git-scm.com/book/en/v2 . 25 December 2015.
- Web site: Krill . Paul . 28 September 2016 . Enterprise repo wars: GitHub vs. GitLab vs. Bitbucket . 2 February 2020 . InfoWorld.
- News: Brown . Zack . 2018-07-27 . A Git Origin Story . Linux Journal . Linux Journal . live . 2020-05-28 . https://web.archive.org/web/20200413113107/https://www.linuxjournal.com/content/git-origin-story . 2020-04-13.
- Web site: 2005-04-11 . BitKeeper and Linux: The end of the road? . 2023-05-18 . Linux.com . en-US.
- News: McAllister . Neil . Linus Torvalds' BitKeeper blunder . InfoWorld . 2005-05-02 . 2015-09-08 . live . https://web.archive.org/web/20150826064920/http://www.infoworld.com/article/2670360/operating-systems/linus-torvalds--bitkeeper-blunder.html . 26 August 2015.
- Re: Trivia: When did git self-host? . 2007-02-27 . Torvalds . Linus . git.
- Kernel SCM saga.. . 2005-04-06 . Torvalds . Linus . linux-kernel.
- First ever real kernel git merge! . 2005-04-17 . Torvalds . Linus . git.
- Mercurial 0.4b vs git patchbomb benchmark . 2005-04-29 . Mackall . Matt . git.
- Linux 2.6.12 . 2005-06-17 . Torvalds . Linus . git-commits-head.
- Meet the new maintainer.. . 2005-07-27 . Torvalds . Linus . git.
- Announce: Git 1.0.0 . 2005-12-21 . Hamano . Junio C. . git.
- Web site: GitFaq: Why the 'Git' name? . Git.or.cz . 2012-07-14 . live . https://web.archive.org/web/20120723224559/https://git.wiki.kernel.org/index.php/GitFaq#Why_the_.27Git.27_name.3F . 23 July 2012.
- Web site: After controversy, Torvalds begins work on 'git' . 2012-07-14 . PC World . Torvalds seemed aware that his decision to drop BitKeeper would also be controversial. When asked why he called the new software, 'git', British slang meaning 'a rotten person', he said. 'I'm an egotistical bastard, so I name all my projects after myself. First Linux, now git.' . live . https://web.archive.org/web/20110201184934/http://www.pcworld.idg.com.au/article/129776/after_controversy_torvalds_begins_work_git_/ . 1 February 2011.
- Web site: git(1) Manual Page . 2012-07-21 . live . https://web.archive.org/web/20120621133627/http://www.git-scm.com/docs/git.html . 21 June 2012.
- Web site: Initial revision of 'git', the information manager from hell · git/git@e83c516 . GitHub . 2016-01-21 . live . https://web.archive.org/web/20171008211145/https://github.com/git/git/blob/e83c5163316f89bfbde7d9ab23ca2e25604af290/README . 8 October 2017.
- Web site: Git – Distributed Workflows . Git . 15 June 2020 . 22 October 2014 . https://web.archive.org/web/20141022020026/http://git-scm.com/book/en/Distributed-Git-Distributed-Workflows . live .
- Web site: Gunjal. Siddhesh. 2019-07-19. What is Version Control Tool? Explore Git and GitHub. 2020-10-25. Medium. en.
- Torvalds . Linus . Re: VCS comparison table . 2006-10-19 . git.
- Jst's Blog on Mozillazine Web site: bzr/hg/git performance . 12 February 2015 . dead . https://web.archive.org/web/20100529094107/http://weblogs.mozillazine.org/jst/archives/2006/11/vcs_performance.html . 29 May 2010.
- Web site: Dreier . Roland . Oh what a relief it is . 2006-11-13 . live . https://web.archive.org/web/20090116175841/http://digitalvampire.org/blog/index.php/2006/11/16/oh-what-a-relief-it-is/ . 16 January 2009., observing that "git log" is 100x faster than "svn log" because the latter must contact a remote server.
- Web site: Trust . Git Concepts . Git User's Manual . 2006-10-18 . live . https://web.archive.org/web/20170222053056/https://www.kernel.org/pub/software/scm/git/docs/user-manual.html#trust . 22 February 2017.
- git . Torvalds . Linus . Re: VCS comparison table . 2009-04-10., describing Git's script-oriented design
- Web site: iabervon . Git rocks! . 2005-12-22 . live . https://web.archive.org/web/20160914100946/https://lwn.net/Articles/165202/ . 14 September 2016., praising Git's scriptability.
- Web site: Git – Git SCM Wiki. 2020-10-25. git.wiki.kernel.org.
- Web site: Git User's Manual . 2020-03-10 . live . https://web.archive.org/web/20200510190720/https://mirrors.edge.kernel.org/pub/software/scm/git/docs/git-gc.html . 10 May 2020.
- Web site: Git – Packfiles . Git.
- linux-kernel . Torvalds . Linus . Re: more git updates.. . 2005-04-10.
- git . Haible . Bruno . how to speed up 'git log'? . 2007-02-11.
- git . Torvalds . Linus . Re: impure renames / history tracking . 2006-03-01.
- git . Hamano . Junio C. . Re: Errors GITtifying GCC and Binutils . 2006-03-24.
- git . Hamano . Junio C. . Re: Errors GITtifying GCC and Binutils . 2006-03-23.
- git . Torvalds . Linus . Re: git and bzr . 2006-11-28., on using
git-blame
to show code moved between source files.
- Web site: Torvalds . Linus . git-merge(1) . 2007-07-18 . live . https://web.archive.org/web/20160716100147/https://www.kernel.org/pub/software/scm/git/docs/git-merge.html . 16 July 2016.
- linux-kernel . Torvalds . Linus . Re: more git updates... . 2005-04-10.
- git . Torvalds . Linus . Re: Errors GITtifying GCC and Binutils . 2006-03-23 . 3 February 2017 . 22 March 2021 . https://web.archive.org/web/20210322043017/https://marc.info/?l=git&m=114314642000462 . live .
- Web site: Git – Git Objects . Git.
- Web site: Git – Git References . Git.
- Web site: Git Cheat Sheet . 10 June 2024 . education.github.com.
- Web site: Git Tutorial . 10 June 2024 . web.stanford.edu.
- Web site: Git Quick Intro . 10 June 2024 . data-skills.github.io.
- Web site: Ba Tran . Andrew . Best practices for uploading to GitHub . 10 June 2024 . journalismcourses.org.
- Web site: Project Configuration File Format . Gerrit Code Review . 2 February 2020 . 3 December 2020 . https://web.archive.org/web/20201203033602/https://gerrit-review.googlesource.com/Documentation/config-project-config.html . live .
- Web site: downloads . 14 May 2012 . live . https://web.archive.org/web/20120508021315/http://git-scm.com/downloads . 8 May 2012.
- Web site: 30 November 2021. git package versions – Repology. dead. 19 January 2022. https://web.archive.org/web/20220119103832/https://repology.org/project/git/versions.
- Web site: msysGit . 20 September 2016 . . live . https://web.archive.org/web/20161010143600/https://github.com/msysgit/msysgit . 10 October 2016.
- Web site: Git – Downloading Package . Git. (source code)
- Web site: JGit . 24 August 2012 . live . https://web.archive.org/web/20120831201256/http://eclipse.org/jgit/ . 31 August 2012.
- Web site: Git – go-git. Git. 2019-04-19.
- Web site: Keybase launches encrypted git. keybase.io. 2019-04-19.
- Web site: Dulwich GitHub Repository README.md . 29 April 2024 . . live . https://web.archive.org/web/20240429015034/https://github.com/jelmer/dulwich#supported-versions-of-python . 29 April 2024.
- Web site: libgit2 . 24 August 2012 . . live . https://web.archive.org/web/20160411135623/https://github.com/libgit2/libgit2/blob/master/README.md . 11 April 2016.
- Web site: rugged . 24 August 2012 . . live . https://web.archive.org/web/20130724042431/https://github.com/libgit2/rugged . 24 July 2013.
- Web site: pygit2 . 24 August 2012 . . live . https://web.archive.org/web/20150805001221/https://github.com/libgit2/pygit2 . 5 August 2015.
- Web site: hlibgit2 . 30 April 2013 . live . https://web.archive.org/web/20130525064750/http://hackage.haskell.org/package/hlibgit2 . 25 May 2013.
- Web site: js-git: a JavaScript implementation of Git . . 2013-08-13 . live . https://web.archive.org/web/20130807173550/https://github.com/creationix/js-git . 7 August 2013.
- Web site: Game of Trees . 2024-03-10 . gameoftrees.org.
- Web site: Git – Git Daemon. Git. 2019-07-10.
- https://git-scm.com/book/en/Git-on-the-Server-Setting-Up-the-Server 4.4 Git on the Server – Setting Up the Server
- Web site: 1.4 Getting Started – Installing Git . Git . 2013-11-01 . live . https://web.archive.org/web/20131102192025/http://git-scm.com/book/ch4-1.html#The-Git-Protocol . 2 November 2013.
- Book: https://git-scm.com/book/en/v2/Git-on-the-Server-Setting-Up-the-Server. Git on the Server – Setting Up the Server. Scott. Chacon. Ben. Straub. Pro Git. Apress. 2014. 978-1484200773. 2nd.
- https://secure.phabricator.com/book/phabricator/article/diffusion_hosting/ Diffusion User Guide: Repository Hosting
- Web site: Gitolite: Hosting Git Repositories.
- Web site: Gogs: A painless self-hosted Git service.
- Web site: 2020-03-22. Highlights from Git 2.26. 2020-11-25. The GitHub Blog. en-US. You may remember when Git introduced a new version of its network fetch protocol way back in 2018. That protocol is now used by default in 2.26, so let’s refresh ourselves on what that means. The biggest problem with the old protocol is that the server would immediately list all of the branches, tags, and other references in the repository before the client had a chance to send anything. For some repositories, this could mean sending megabytes of extra data, when the client really only wanted to know about the master branch. The new protocol starts with the client request and provides a way for the client to tell the server which references it’s interested in. Fetching a single branch will only ask about that branch, while most clones will only ask about branches and tags. This might seem like everything, but server repositories may store other references (such as the head of every pull request opened in the repository since its creation). Now, fetches from large repositories improve in speed, especially when the fetch itself is small, which makes the cost of the initial reference advertisement more expensive relatively speaking. And the best part is that you won’t need to do anything! Due to some clever design, any client that speaks the new protocol can work seamlessly with both old and new servers, falling back to the original protocol if the server doesn’t support it. The only reason for the delay between introducing the protocol and making it the default was to let early adopters discover any bugs.. 22 March 2021. https://web.archive.org/web/20210322043004/https://github.blog/2020-03-22-highlights-from-git-2-26/. live.
- Web site: github.com Competitive Analysis, Marketing Mix and Traffic . Alexa . 2 February 2020 . 31 March 2013 . https://web.archive.org/web/20130331175229/http://www.alexa.com/siteinfo/github.com . dead .
- Web site: sourceforge.net Competitive Analysis, Marketing Mix and Traffic . Alexa . 2 February 2020 . 20 October 2020 . https://web.archive.org/web/20201020111244/https://www.alexa.com/siteinfo/sourceforge.net . live .
- Web site: bitbucket.org Competitive Analysis, Marketing Mix and Traffic . Alexa . 2 February 2020 . 23 June 2017 . https://web.archive.org/web/20170623162208/http://www.alexa.com/siteinfo/bitbucket.org . dead .
- Web site: gitlab.com Competitive Analysis, Marketing Mix and Traffic . Alexa . 2 February 2020 . 30 November 2017 . https://web.archive.org/web/20171130123149/https://www.alexa.com/siteinfo/gitlab.com . live .
- Web site: Git - git-gui Documentation . 2024-07-01 . Git . en.
- Web site: Git - GUI Clients . 2024-07-01 . Git . en.
- Web site: Eclipse Community Survey 2014 results | Ian Skerrett . Ianskerrett.wordpress.com . 2014-06-23 . 2014-06-23 . live . https://web.archive.org/web/20140625152145/http://ianskerrett.wordpress.com/2014/06/23/eclipse-community-survey-2014-results/ . 25 June 2014.
- Web site: Results of Eclipse Community Survey 2012 . eclipse.org. live . https://web.archive.org/web/20160411135719/http://www.eclipse.org/org/community_survey/Survey_Final_Results_2012.xls . 11 April 2016.
- Web site: Compare Repositories – Open Hub . live . https://web.archive.org/web/20140907051024/https://www.openhub.net/repositories/compare . 7 September 2014.
- Web site: Stack Overflow Annual Developer Survey . Stack Exchange, Inc. . 9 January 2020 . Stack Overflow’s annual Developer Survey is the largest and most comprehensive survey of people who code around the world. Each year, we field a survey covering everything from developers' favorite technologies to their job preferences. This year marks the ninth year we’ve published our annual Developer Survey results, and nearly 90,000 developers took the 20-minute survey earlier this year..
- Web site: Stack Overflow Developer Survey 2015 . Stack Overflow . 29 May 2019 . dead . https://web.archive.org/web/20190504144447/https://insights.stackoverflow.com/survey/2015#tech-sourcecontrol . 4 May 2019.
- Web site: Stack Overflow Developer Survey 2017 . Stack Overflow . 29 May 2019 . dead . https://web.archive.org/web/20190529004901/https://insights.stackoverflow.com/survey/2017#work-_-version-control . 29 May 2019.
- Web site: Stack Overflow Developer Survey 2018 . Stack Overflow . 29 May 2019 . dead . https://web.archive.org/web/20190530142357/https://insights.stackoverflow.com/survey/2018/#work-_-version-control . 30 May 2019.
- Web site: Stack Overflow Developer Survey 2022 . 2022-08-04 . Stack Overflow . en.
- Web site: Git (software) Jobs, Average Salary for Git Distributed Version Control System Skills . Itjobswatch.co.uk . 2016-09-30 . live . https://web.archive.org/web/20161008072321/http://www.itjobswatch.co.uk/jobs/uk/git%20(software).do . 8 October 2016.
- Web site: Team Foundation Server Jobs, Average Salary for Microsoft Team Foundation Server (TFS) Skills . Itjobswatch.co.uk . 2016-09-30 . live . https://web.archive.org/web/20161029185314/http://www.itjobswatch.co.uk/jobs/uk/team%20foundation%20server.do . 29 October 2016.
- Web site: Subversion Jobs, Average Salary for Apache Subversion (SVN) Skills . Itjobswatch.co.uk . 2016-09-30 . live . https://web.archive.org/web/20161025011418/http://www.itjobswatch.co.uk/jobs/uk/subversion.do . 25 October 2016.
- Web site: Mercurial Jobs, Average Salary for Mercurial Skills . Itjobswatch.co.uk . 2016-09-30 . live . https://web.archive.org/web/20160923081538/http://www.itjobswatch.co.uk/jobs/uk/mercurial.do . 23 September 2016.
- Web site: VSS/SourceSafe Jobs, Average Salary for Microsoft Visual SourceSafe (VSS) Skills . Itjobswatch.co.uk . 2016-09-30 . live . https://web.archive.org/web/20161029043610/http://www.itjobswatch.co.uk/jobs/uk/vss/sourcesafe.do . 29 October 2016.
- Web site: Windows switch to Git almost complete: 8,500 commits and 1,760 builds each day . Ars Technica . 24 May 2017 . 24 May 2017 . live . https://web.archive.org/web/20170524171707/https://arstechnica.com/information-technology/2017/05/90-of-windows-devs-now-using-git-creating-1760-windows-builds-per-day/ . 24 May 2017.
- Web site: git-init. Git. https://web.archive.org/web/20220315095632/https://git-scm.com/docs/git-init. March 15, 2022. live.
- Web site: Git – Branches in a Nutshell . Git . 15 June 2020 . The "master" branch in Git is not a special branch. It is exactly like any other branch. The only reason nearly every repository has one is that the git init command creates it by default and most people don’t bother to change it. . 20 December 2020 . https://web.archive.org/web/20201220123258/http://git-scm.com/book/en/v2/Git-Branching-Branches-in-a-Nutshell#ch03-git-branching . live .
- Web site: Git Revert Atlassian Git Tutorial . Atlassian . en . Reverting has two important advantages over resetting. First, it doesn’t change the project history, which makes it a "safe" operation for commits that have already been published to a shared repository..
- Web site: Gitflow Workflow Atlassian Git Tutorial . Atlassian . 15 June 2020 . en.
- Web site: Forking Workflow Atlassian Git Tutorial . Atlassian . 15 June 2020 . en.
- Web site: Git repository access control . 2016-09-06 . live . https://web.archive.org/web/20160914114143/https://wincent.com/wiki/git_repository_access_control . 14 September 2016.
- Web site: Securing your Git server against CVE-2014-9390 . 20 December 2014 . 22 December 2014 . Pettersen . Tim . live . https://web.archive.org/web/20141224012942/https://developer.atlassian.com/blog/2014/12/securing-your-git-server/ . 24 December 2014.
- [Announce] Git v2.2.1 (and updates to older maintenance tracks) ]. Hamano, J. C.. 18 December 2014. gmane.linux.kernel . https://web.archive.org/web/20141219024646/http://article.gmane.org/gmane.linux.kernel/1853266 . dead . 19 December 2014 . 22 December 2014.
- Web site: CVE-2015-7545 . 15 December 2015 . 26 December 2015 . live . https://web.archive.org/web/20151226232616/http://people.canonical.com/~ubuntu-security/cve/2015/CVE-2015-7545.html . 26 December 2015.
- Web site: Git 2.6.1 . . 26 December 2015 . 29 September 2015 . live . https://web.archive.org/web/20160411135802/https://github.com/git/git/commit/22f698cb188243b313e024d618283e0293e37140 . 11 April 2016.
- Web site: Re: CVE Request: git . Blake Burkhart . 26 December 2015 . 5 October 2015 . etal . live . https://web.archive.org/web/20151227054727/http://seclists.org/oss-sec/2015/q4/67 . 27 December 2015.
- Web site: hash – How safe are signed git tags? Only as safe as SHA-1 or somehow safer? . Information Security Stack Exchange . 22 September 2014 . live . https://web.archive.org/web/20160624232415/https://security.stackexchange.com/questions/67920/how-safe-are-signed-git-tags-only-as-safe-as-sha-1-or-somehow-safer . 24 June 2016.
- Web site: Why does Git use a cryptographic hash function? . Stack Overflow . 1 March 2015 . live . https://web.archive.org/web/20160701214638/http://stackoverflow.com/questions/28792784/why-does-git-use-a-cryptographic-hash-function . 1 July 2016.
- Web site: Git – hash-function-transition Documentation . Git.