Hard link explained

In computing, a hard link is a directory entry (in a directory-based file system) that associates a name with a file. Thus, each file must have at least one hard link. Creating additional hard links for a file makes the contents of that file accessible via additional paths (i.e., via different names or in different directories).[1] This causes an alias effect: a process can open the file by any one of its paths and change its content. By contrast, a soft link or “shortcut” to a file is not a direct link to the data itself, but rather a reference to a hard link or another soft link.

Every directory is itself a special file on many systems, containing a list of file names instead of other data. Hence, multiple hard links to directories are possible, which could create a circular directory structure, rather than a branching structure like a tree. For that reason, some file systems forbid the creation of additional hard links to directories.

POSIX-compliant operating systems, such as Linux, Android, macOS, and the Windows NT family,[2] support multiple hard links to the same file, depending on the file system. For instance, NTFS and ReFS support hard links,[3] while FAT does not.

Operation

Let two hard links, named "LINK A.TXT" and "LINK B.TXT", point to the same physical data. A text editor opens "LINK A.TXT", modifies it and saves it. When the editor (or any other app) opens "LINK B.TXT", it can see those changes made to "LINK A.TXT", since both file names point to the same data. So from a user's point of view this is one file with several filenames. Editing any filename modifies "all" files, however deleting "any" filename except the last one keeps the file around.

However, some editors, such as GNU Emacs, break the hard link concept. When opening a file for editing, e.g., "LINK B.TXT", emacs renames "LINK B.TXT" to "LINK B.TXT~", loads "LINK B.TXT~" into the editor, and saves the modified contents to a newly created "LINK B.TXT". Now, "LINK A.TXT" and "LINK B.TXT" no longer shares the same data. (This behavior can be changed using the emacs variable backup-by-copying.)

Any number of hard links to the physical data may be created. To access the data, a user only needs to specify the name of any existing link; the operating system will resolve the location of the actual data. Even if the user deletes one of the hard links, the data is still accessible through any other link that remains. Once the user deletes all of the links, if no process has the file open, the operating system frees the disk space that the file once occupied.

Reference counting

Most file systems that support hard links use reference counting. The system stores an integer value with each logical data section that represents the total number of hard links that have been created to point to the data. When a new link is created, this value is increased by one. When a link is removed, the value is decreased by one. When the counter becomes zero, the operating system frees the logical data section. (The OS may not to do so immediately, e.g., when there are outstanding file handles open, for performance reasons, or to enable the undelete command.)

This is a simple method for the file system to track the use of a given area of storage, as zero values indicate free space and nonzero values indicate used space. The maintenance of this value guarantees that there will be no dangling hard links pointing nowhere. The data section and the associated inode are preserved as long as a single hard link (directory reference) points to it or any process keeps the associated file open.

On POSIX-compliant operating systems, the reference count for a file or directory is returned by the stat or fstat system calls in the st_nlink field of struct stat.

Limitations

To prevent loops in the filesystem, and to keep the interpretation of the "" file (parent directory) consistent, operating systems do not generally allow hard links to directories. UNIX System V allowed them, but only the superuser had permission to make such links.[4] Mac OS X v10.5 (Leopard) and newer use hard links on directories for the Time Machine backup mechanism only.[5]

Hard links can be created to files only on the same volume, i.e., within the same file system. (Different volumes may have different file systems. There is no guarantee that the target volume's file system is compatible with hard linking.)

The maximum number of hard links to a single file is limited by the size of the reference counter. On Unix-like systems the counter is 4,294,967,295 (on 32-bit machines) or 18,446,744,073,709,551,615 (on 64-bit machines). In some file systems, the number of hard links is limited more strictly by their on-disk format. For example, as of Linux 3.11, the ext4 file system limits the number of hard links on a file to 65,000.[6] Windows limits enforces a limit of 1024 hard links to a file on NTFS volumes.[7]

On Linux Weekly News, Neil Brown criticized hard links as high-maintenance, since they complicate the design of programs that handle directory trees, including archivers and disk usage tools. These apps must take care to de-duplicate files that are linked multiple times in a hierarchy. Brown notes that Plan 9 from Bell Labs, the intended successor to Unix, does not include the concept of a hard link.[8]

Platform support

Windows NT 3.1 and later support hard links on the NTFS file system.[9] Windows 2000 introduces a CreateHardLink function to create hard links, but only for files, not directories.[10] The DeleteFile function can remove them.

To create a hard link on Windows, end-users can use:

To interrogate a file for its hard links, end-users can use:

The Windows Component Store uses hard links to keep track of different versions of components stored on the hard disk drive.

On Unix-like systems, the system call can create additional hard links to existing files. To create hard links, end-users can use:

To interrogate a file for its hard links, end-users can use:

Unix-like emulation or compatibility software running on Microsoft Windows, such as Cygwin and Subsystem for UNIX-based Applications, allow the use of POSIX interfaces.

OpenVMS supports hard links on the ODS-5 file system.[15] Unlike Unix, VMS can create hard links to directories.

See also

Details the four link types that the NTFS supports—hard links, symbolic links, junction points, and volume mount points

macOS implementation of a shortcut

Notes and References

  1. Web site: Pitcher. Lew. Q & A: The difference between hard and soft links.
  2. Web site: Link Shell Extension.
  3. Web site: Resilient File System (ReFS) overview . Microsoft Learn . . 26 October 2022.
  4. Book: Bach, Maurice J. . 1986 . The Design of the UNIX Operating System . registration . Prentice Hall . 128 . 9780132017992 .
  5. Web site: Pond . James . How Time Machine Works its Magic . May 19, 2019 . August 31, 2013 . File System Event Store, Hard Links . June 21, 2019 . https://web.archive.org/web/20190621092705/https://www.baligu.com/pondini/TM/Works.html . dead .
  6. Web site: Linux kernel source tree, fs/ext4/ext4.h, line 229.
  7. Web site: CreateHardLinkA function (winbase.h) . Windows App Development . . 13 October 2021.
  8. Web site: Ghosts of Unix past, part 4: High-maintenance designs . Linux Weekly News . 23 November 2010 . 20 April 2014 . Neil . Brown.
  9. Web site: How hard links work. Microsoft Docs. 6 January 2011 .
  10. Web site: 10 March 2011. CreateHardLink Function. usurped. https://web.archive.org/web/20110702043053/http://msdn.microsoft.com/en-us/library/Aa363860. 2 July 2011. Windows Development. Microsoft. MSDN. Establishes a hard link between an existing file and a new file. This function is only supported on the NTFS file system, and only for files, not directories..
  11. Web site: 18 April 2012. Fsutil hardlink. Windows App Development. Microsoft. Microsoft Docs.
  12. Web site: Mklink . Microsoft Docs . . 18 April 2012.
  13. Web site: New-Item (PowerShell 3.0) . Microsoft Docs . . 22 June 2020 . If your location is in a FileSystem drive, the following values are allowed: If your location is in a FileSystem drive, the following values are allowed: File[,] Directory[,] Junction[,] HardLink.
  14. Web site: 20 November 2021. FileSystemProvider.cs. PowerShell / PowerShell repo. Microsoft. Lines 8139–8234. GitHub.
  15. Web site: OpenVMS System Manager's Manual, Vol. I. 2021-01-23. VSI. August 2019.