File descriptor explained

In Unix and Unix-like computer operating systems, a file descriptor (FD, less frequently fildes) is a process-unique identifier (handle) for a file or other input/output resource, such as a pipe or network socket.

File descriptors typically have non-negative integer values, with negative values being reserved to indicate "no value" or error conditions.

File descriptors are a part of the POSIX API. Each Unix process (except perhaps daemons) should have three standard POSIX file descriptors, corresponding to the three standard streams:

Integer valueName symbolic constant[1] file stream[2]
0Standard input
1Standard output
2Standard error

Overview

In the traditional implementation of Unix, file descriptors index into a per-process maintained by the kernel, that in turn indexes into a system-wide table of files opened by all processes, called the . This table records the mode with which the file (or other resource) has been opened: for reading, writing, appending, and possibly other modes. It also indexes into a third table called the inode table that describes the actual underlying files.[3] To perform input or output, the process passes the file descriptor to the kernel through a system call, and the kernel will access the file on behalf of the process. The process does not have direct access to the file or inode tables.

On Linux, the set of file descriptors open in a process can be accessed under the path /proc/PID/fd/, where PID is the process identifier. File descriptor /proc/PID/fd/0 is stdin, /proc/PID/fd/1 is stdout, and /proc/PID/fd/2 is stderr. As a shortcut to these, any running process can also access its own file descriptors through the folders /proc/self/fd and /dev/fd.[4]

In Unix-like systems, file descriptors can refer to any Unix file type named in a file system. As well as regular files, this includes directories, block and character devices (also called "special files"), Unix domain sockets, and named pipes. File descriptors can also refer to other objects that do not normally exist in the file system, such as anonymous pipes and network sockets.

The FILE data structure in the C standard I/O library usually includes a low level file descriptor for the object in question on Unix-like systems. The overall data structure provides additional abstraction and is instead known as a file handle.

Operations on file descriptors

The following lists typical operations on file descriptors on modern Unix-like systems. Most of these functions are declared in the <unistd.h> header, but some are in the <fcntl.h> header instead.

Creating file descriptors

Operations on a single file descriptor

Operations on multiple file descriptors

Operations on the file descriptor table

The function is used to perform various operations on a file descriptor, depending on the command argument passed to it. There are commands to get and set attributes associated with a file descriptor, including and .

Operations that modify process state

File locking

Sockets

See also: Berkeley sockets.

Miscellaneous

Upcoming operations

A series of new operations on file descriptors has been added to many modern Unix-like systems, as well as numerous C libraries, to be standardized in a future version of POSIX.[7] The at suffix signifies that the function takes an additional first argument supplying a file descriptor from which relative paths are resolved, the forms lacking the at suffix thus becoming equivalent to passing a file descriptor corresponding to the current working directory. The purpose of these new operations is to defend against a certain class of TOCTOU attacks.

File descriptors as capabilities

Unix file descriptors behave in many ways as capabilities. They can be passed between processes across Unix domain sockets using the [[sendmsg]] system call. Note, however, that what is actually passed is a reference to an "open file description" that has mutable state (the file offset, and the file status and access flags). This complicates the secure use of file descriptors as capabilities, since when programs share access to the same open file description, they can interfere with each other's use of it by changing its offset or whether it is blocking or non-blocking, for example.[8] [9] In operating systems that are specifically designed as capability systems, there is very rarely any mutable state associated with a capability itself.

A Unix process' file descriptor table is an example of a C-list.

See also

Notes and References

  1. Web site: The Open Group Base Specifications Issue 7, IEEE Std 1003.1-2008, 2016 Edition . The Open Group . 2017-09-21 . The Open Group .
  2. Web site: The Open Group Base Specifications Issue 7, IEEE Std 1003.1-2008, 2016 Edition . . The Open Group . 2017-09-21 .
  3. Book: Bach , Maurice J. . The Design of the UNIX Operating System. 1986. 8. Prentice-Hall. 92–96. 9780132017992. registration.
  4. Web site: Devices - What does the output of 'll /Proc/Self/Fd/' (From 'll /Dev/Fd') mean?.
  5. Web site: The Open Group Base Specifications Issue 7, IEEE Std 1003.1-2008, 2018 Edition – creat . The Open Group . 2019-04-11 . The Open Group .
  6. Web site: close_range(2) — Linux manual page . Stephen Kitt, Michael Kerrisk . 2021-03-22 .
  7. Book: Extended API Set, Part 2 . The Open Group . October 2006 . registration . 1931624674.
  8. Web site: Building a bridge: library API's and file descriptors?. Marcus. Brinkmann. 2009-02-04. cap-talk. 2017-09-21. https://archive.today/20120730163337/http://www.eros-os.org/pipermail/cap-talk/2009-February/012137.html. 2012-07-30. dead.
  9. Web site: Don't set shared file descriptors to non-blocking I/O mode.. Jonathan. de Boyne Pollard. 2007. 2017-09-21.