Non-breaking space explained

Non-breaking space
See Also:

Other types of spaces

In word processing and digital typesetting, a non-breaking space, also called NBSP, required space,[1] hard space, or fixed space (in most typefaces, it is not of fixed width), is a space character that prevents an automatic line break at its position. In some formats, including HTML, it also prevents consecutive whitespace characters from collapsing into a single space. Non-breaking space characters with other widths also exist.

Uses and variations

Despite having layout and uses similar to those of whitespace, it differs in contextual behavior.[2] [3]

Non-breaking behavior

Non-collapsing behavior

A second common application of non-breaking spaces is in plain text file formats such as SGML, HTML, TeX and LaTeX, whose rendering engines are programmed to treat sequences of whitespace characters (space, newline, tab, form feed, etc.) as if they were a single character (but this behavior can be overridden). Such "collapsing" of whitespace allows the author to neatly arrange the source text using line breaks, indentation and other forms of spacing without affecting the final typeset result.[4] [5]

In contrast, non-breaking spaces are not merged with neighboring whitespace characters when displayed. They can, therefore, be used by an author to simply insert additional visible space in the resulting output without using spans styled with peculiar values of the CSS "white-space" property. Conversely, indiscriminate use (see the recommended use in style guides), in addition to a normal space, gives extraneous space in the output.

Width variation

Other non-breaking variants, defined in Unicode:

Due to the tighter binding of value and unit as a continuous visual element NNBSP is recommended for usage in the SI-standard.[6] It was introduced in Unicode 3.0 for Mongolian,[7] to separate a suffix[8] from the word stem without indicating a word boundary. It is also required for big punctuation (sometimes inaccurately referred to as "double punctuation") in French where it is called French: espace fine insécable (before ;, ?, !, », and after «, ; today often (i.e. in French DTP, referred to[9] as new-school) also before :) and in German between multi-part abbreviations (e.g., "z.B.", "d.h.", "v.l.n.r.").[10] When used with Mongolian, its width is usually one third of the normal space; in other contexts, its width is about 70% of the normal space but may resemble that of the thin space (U+2009), at least with some fonts.[11] Also starting from release 34 of Unicode Common Locale Data Repository (CLDR) the NNBSP is used in numbers as thousands group separator for French and Spanish locale.[12] [13]
Produces a space equal to the figure (0 - 9) characters.
Encoded in Unicode since version 3.2. The word joiner does not produce any space and prohibits a line break at its position.

Example

On browsers, resizing the window will demonstrate the effect of non-breaking spaces on the texts below.

To show the non-breaking effect of the non-breaking space, the following words have been separated with non-breaking spaces:

Lorem Ipsum Dolor Sit Amet Consectetur Adipiscing Elit Sed Do Eiusmod Tempor Incididunt Ut Labore Et Dolore Magna Aliqua Ut Enim Ad Minim Veniam Quis Nostrud Exercitation Ullamco Laboris Nisi Ut Aliquip Ex Ea Commodo Consequat Duis Aute

To show the non-collapsing behavior of the non-breaking space, the following words have been separated with an increasing number of non-breaking spaces:

Lorem Ipsum  Dolor   Sit    Amet     Consectetur      Adipiscing       Elit        Sed         Do          Eiusmod           Tempor            Incididunt             Ut              Labore               Et                Dolore                 Magna                  Aliqua                   Ut                    Enim                     Ad                      Minim

In contrast, the following words are separated with ordinary spaces:

Lorem Ipsum Dolor Sit Amet Consectetur Adipiscing Elit Sed Do Eiusmod Tempor Incididunt Ut Labore Et Dolore Magna Aliqua Ut Enim Ad Minim Veniam Quis Nostrud Exercitation Ullamco Laboris Nisi Ut Aliquip Ex Ea Commodo Consequat Duis Aute

Encodings

In Unicode, the byte order mark (BOM), U+FEFF, may be interpreted as a "zero width no-break space", but is a deprecated alternative to word joiner (U+2060).

Keyboard entry methods

It is rare for national or international standards on keyboard layouts to define an input method for the non-breaking space. An exception is the Finnish multilingual keyboard, accepted as the national standard SFS 5966 in 2008. According to SFS 5966, the non-breaking space can be entered with the key combination AltGr + Space.[14]

Typically, authors of keyboard drivers and application programs (e.g., word processors) have devised their own keyboard shortcuts for the non-breaking space. For example:

System/applicationEntry method
Microsoft Windows+ or + (does not always work)
macOS
Linux or Unix using X11,, or
AmigaOS
GNU Emacs
Vim,, ; or,,
Dreamweaver, LibreOffice, Microsoft Word,
OpenOffice.org (since 3.0), AutoCAD
FrameMaker, LyX (non-Mac), OpenOffice.org (before 3.0),
WordPerfect
Mac Adobe InDesign
Python programming language'\N{NO-BREAK SPACE}'[15] or '\xa0'/'\u00a0'

Apart from this, applications and environments often have methods of entering unicode entities directly via their code point, e.g., via the Alt Numpad input method. (Non-breaking space has code point 255 decimal (FF hex) in codepage 437 and codepage 850 and code point 160 decimal (A0 hex) in codepage 1252.)

See also

Notes and References

  1. Web site: Windows, Latin 1 . CPGID 01252 . REGISTRY, Graphic Character Sets and Code Pages . IBM . IBM . 1998 . 1995.
  2. 10.3998/3336451.0013.105. Justify Just or Just Justify. The Journal of Electronic Publishing. 13. 2010. Elyaakoubi. Mohamed. Lazrek. Azzeddine. free. 2027/spo.3336451.0013.105. free.
  3. Web site: Special Characters . The Chicago Manual of Style Online.
  4. .
  5. .
  6. Web site: SI-Standard mentioning the insertion of non-breaking space between value and unit .
  7. ISO/IEC 10646-1:1993/FDAM 29:1999(E)
  8. Web site: Mongolian NNBSP-connected suffixes. December 28, 2014. W3.
  9. Web site: Typo : caractères capitales accentués . claude72 . 2005-04-25 . Forum App Store et applications > Applications Mac > Création graphique . 5 . MacGeneration . 2019-03-30.
  10. Web site: Zweisprachige Mikrotypografie . Solbrig . Amelie . 30 January 2008 . Hochschule für Technik, Wirtschaft und Kultur Leipzig . 10 June 2018 . Alle Abkürzungen mit Binnenpunkten werden im Deutschen mit einem gFL [geschütztes flexibles Leerzeichen] spationiert. [...] Die englische Schreibweise sieht keine Abstände zwischen einzelnen Buchstaben vor. Nach einem Binnenpunkt folgt demnach ohne gFL sofort der nächste Buchstabe. . de. https://web.archive.org/web/20160311073646/http://www.verlagsherstellung.de/fileadmin/fbmedien_bmp/downloads/Abschlussarbeiten/Zweisprachige_Mikrotypografie_Amelie_Solbrig_VH-02.pdf . 2016-03-11 . 58 (PDF p. 113) .
  11. Web site: Writing Systems and Punctuation . The Unicode Standard 7.0 . . 2014 . 2014-11-02.
  12. Web site: CLDR Chart: Numbers .
  13. Web site: RAE. RAE. punto Diccionario panhispánico de dudas. 2022-01-07. «Diccionario panhispánico de dudas». es.
  14. . Drafts of the Finnish multilingual keyboard.
  15. Web site: Python Software Foundation . Unicode HOWTO¶ . Python 3.10.6 documentation . 19 August 2022.