GeForce | |
Manufacturer: | |
Designfirm: | Nvidia |
Marketed By: | Nvidia |
Numcores: | Up to 16,384 CUDA cores |
Process: | 220 nm to 3 nm |
Predecessor: | RIVA TNT2 |
Variant: | Nvidia Quadro, Nvidia Tesla |
GeForce is a brand of graphics processing units (GPUs) designed by Nvidia and marketed for the performance market. As of the GeForce 40 series, there have been eighteen iterations of the design. The first GeForce products were discrete GPUs designed for add-on graphics boards, intended for the high-margin PC gaming market, and later diversification of the product line covered all tiers of the PC graphics market, ranging from cost-sensitive GPUs integrated on motherboards, to mainstream add-in retail boards. Most recently, GeForce technology has been introduced into Nvidia's line of embedded application processors, designed for electronic handhelds and mobile handsets.
With respect to discrete GPUs, found in add-in graphics-boards, Nvidia's GeForce and AMD's Radeon GPUs are the only remaining competitors in the high-end market. GeForce GPUs are very dominant in the general-purpose graphics processor unit (GPGPU) market thanks to their proprietary Compute Unified Device Architecture (CUDA).[1] GPGPU is expected to expand GPU functionality beyond the traditional rasterization of 3D graphics, to turn it into a high-performance computing device able to execute arbitrary programming code in the same way a CPU does, but with different strengths (highly parallel execution of straightforward calculations) and weaknesses (worse performance for complex branching code).
The "GeForce" name originated from a contest held by Nvidia in early 1999 called "Name That Chip". The company called out to the public to name the successor to the RIVA TNT2 line of graphics boards. There were over 12,000 entries received and seven winners received a RIVA TNT2 Ultra graphics card as a reward.[2] [3] Brian Burke, senior PR manager at Nvidia, told Maximum PC in 2002 that "GeForce" originally stood for "Geometry Force" since GeForce 256 was the first GPU for personal computers to calculate the transform-and-lighting geometry, offloading that function from the CPU.[4]
See main article: GeForce 256.
See main article: GeForce 2 series.
Launched in march 2000, the first GeForce2 (NV15) was another high-performance graphics chip. Nvidia moved to a twin texture processor per pipeline (4x2) design, doubling texture fillrate per clock compared to GeForce 256. Later, Nvidia released the GeForce2 MX (NV11), which offered performance similar to the GeForce 256 but at a fraction of the cost. The MX was a compelling value in the low/mid-range market segments and was popular with OEM PC manufacturers and users alike. The GeForce 2 Ultra was the high-end model in this series.
See main article: GeForce 3 series.
Launched in February 2001, the GeForce3 (NV20) introduced programmable vertex and pixel shaders to the GeForce family and to consumer-level graphics accelerators. It had good overall performance and shader support, making it popular with enthusiasts although it never hit the midrange price point. The NV2A developed for the Microsoft Xbox game console is a derivative of the GeForce 3.
See main article: GeForce 4 series.
Launched in February 2002, the then-high-end GeForce4 Ti (NV25) was mostly a refinement to the GeForce3. The biggest advancements included enhancements to anti-aliasing capabilities, an improved memory controller, a second vertex shader, and a manufacturing process size reduction to increase clock speeds. Another member of the GeForce 4 family, the budget GeForce4 MX was based on the GeForce2, with the addition of some features from the GeForce4 Ti. It targeted the value segment of the market and lacked pixel shaders. Most of these models used the AGP 4× interface, but a few began the transition to AGP 8×.
See main article: GeForce FX series.
Launched in 2003, the GeForce FX (NV30) was a huge change in architecture compared to its predecessors. The GPU was designed not only to support the new Shader Model 2 specification but also to perform well on older titles. However, initial models like the GeForce FX 5800 Ultra suffered from weak floating point shader performance and excessive heat which required infamously noisy two-slot cooling solutions. Products in this series carry the 5000 model number, as it is the fifth generation of the GeForce, though Nvidia marketed the cards as GeForce FX instead of GeForce 5 to show off "the dawn of cinematic rendering".
See main article: GeForce 6 series.
Launched in April 2004, the GeForce 6 (NV40) added Shader Model 3.0 support to the GeForce family, while correcting the weak floating point shader performance of its predecessor. It also implemented high-dynamic-range imaging and introduced SLI (Scalable Link Interface) and PureVideo capability (integrated partial hardware MPEG-2, VC-1, Windows Media Video, and H.264 decoding and fully accelerated video post-processing).
See main article: GeForce 7 series.
The seventh generation GeForce (G70/NV47) was launched in June 2005 and was the last Nvidia video card series that could support the AGP bus. The design was a refined version of GeForce 6, with the major improvements being a widened pipeline and an increase in clock speed. The GeForce 7 also offers new transparency supersampling and transparency multisampling anti-aliasing modes (TSAA and TMAA). These new anti-aliasing modes were later enabled for the GeForce 6 series as well. The GeForce 7950GT featured the highest performance GPU with an AGP interface in the Nvidia line. This era began the transition to the PCI-Express interface.
A 128-bit, 8 ROP variant of the 7800 GTX, called the RSX Reality Synthesizer, is used as the main GPU in the Sony PlayStation 3.
See main article: GeForce 8 series.
Released on November 8, 2006, the eighth-generation GeForce (originally called G80) was the first ever GPU to fully support Direct3D 10. Manufactured using a 90 nm process and built around the new Tesla microarchitecture, it implemented the unified shader model. Initially just the 8800GTX model was launched, while the GTS variant was released months into the product line's life, and it took nearly six months for mid-range and OEM/mainstream cards to be integrated into the 8 series. The die shrink down to 65 nm and a revision to the G80 design, codenamed G92, were implemented into the 8 series with the 8800GS, 8800GT and 8800GTS-512, first released on October 29, 2007, almost one whole year after the initial G80 release.
See main article: GeForce 9 series and GeForce 100 series.
The first product was released on February 21, 2008.[5] Not even four months older than the initial G92 release, all 9-series designs are simply revisions to existing late 8-series products. The 9800GX2 uses two G92 GPUs, as used in later 8800 cards, in a dual PCB configuration while still only requiring a single PCI-Express 16x slot. The 9800GX2 utilizes two separate 256-bit memory busses, one for each GPU and its respective 512 MB of memory, which equates to an overall of 1 GB of memory on the card (although the SLI configuration of the chips necessitates mirroring the frame buffer between the two chips, thus effectively halving the memory performance of a 256-bit/512 MB configuration). The later 9800GTX features a single G92 GPU, 256-bit data bus, and 512 MB of GDDR3 memory.[6]
Prior to the release, no concrete information was known except that the officials claimed the next generation products had close to 1 TFLOPS processing power with the GPU cores still being manufactured in the 65 nm process, and reports about Nvidia downplaying the significance of Direct3D 10.1.[7] In March 2009, several sources reported that Nvidia had quietly launched a new series of GeForce products, namely the GeForce 100 Series, which consists of rebadged 9 Series parts.[8] [9] [10] GeForce 100 series products were not available for individual purchase.
See main article: GeForce 200 series and GeForce 300 series.
Based on the GT200 graphics processor consisting of 1.4 billion transistors, codenamed Tesla, the 200 series was launched on June 16, 2008.[11] The next generation of the GeForce series takes the card-naming scheme in a new direction, by replacing the series number (such as 8800 for 8-series cards) with the GTX or GTS suffix (which used to go at the end of card names, denoting their 'rank' among other similar models), and then adding model-numbers such as 260 and 280 after that. The series features the new GT200 core on a 65nm die.[12] The first products were the GeForce GTX 260 and the more expensive GeForce GTX 280.[13] The GeForce 310 was released on November 27, 2009, which is a rebrand of GeForce 210.[14] [15] The 300 series cards are rebranded DirectX 10.1 compatible GPUs from the 200 series, which were not available for individual purchase.
See main article: GeForce 400 series and GeForce 500 series.
On April 7, 2010, Nvidia released[16] the GeForce GTX 470 and GTX 480, the first cards based on the new Fermi architecture, codenamed GF100; they were the first Nvidia GPUs to utilize 1 GB or more of GDDR5 memory. The GTX 470 and GTX 480 were heavily criticized due to high power use, high temperatures, and very loud noise that were not balanced by the performance offered, even though the GTX 480 was the fastest DirectX 11 card as of its introduction.
In November 2010, Nvidia released a new flagship GPU based on an enhanced GF100 architecture (GF110) called the GTX 580. It featured higher performance, less power utilization, heat and noise than the preceding GTX 480. This GPU received much better reviews than the GTX 480. Nvidia later also released the GTX 590, which packs two GF110 GPUs on a single card.
See main article: GeForce 600 series, GeForce 700 series and GeForce 800M series.
In September 2010, Nvidia announced that the successor to Fermi microarchitecture would be the Kepler microarchitecture, manufactured with the TSMC 28 nm fabrication process. Earlier, Nvidia had been contracted to supply their top-end GK110 cores for use in Oak Ridge National Laboratory's "Titan" supercomputer, leading to a shortage of GK110 cores. After AMD launched their own annual refresh in early 2012, the Radeon HD 7000 series, Nvidia began the release of the GeForce 600 series in March 2012. The GK104 core, originally intended for their mid-range segment of their lineup, became the flagship GTX 680. It introduced significant improvements in performance, heat, and power efficiency compared to the Fermi architecture and closely matched AMD's flagship Radeon HD 7970. It was quickly followed by the dual-GK104 GTX 690 and the GTX 670, which featured only a slightly cut-down GK104 core and was very close in performance to the GTX 680.
With the GTX Titan, Nvidia also released GPU Boost 2.0, which would allow the GPU clock speed to increase indefinitely until a user-set temperature limit was reached without passing a user-specified maximum fan speed. The final GeForce 600 series release was the GTX 650 Ti BOOST based on the GK106 core, in response to AMD's Radeon HD 7790 release. At the end of May 2013, Nvidia announced the 700 series, which was still based on the Kepler architecture, however it featured a GK110-based card at the top of the lineup. The GTX 780 was a slightly cut-down Titan that achieved nearly the same performance for two-thirds of the price. It featured the same advanced reference cooler design, but did not have the unlocked double-precision cores and was equipped with 3 GB of memory.
At the same time, Nvidia announced ShadowPlay, a screen capture solution that used an integrated H.264 encoder built into the Kepler architecture that Nvidia had not revealed previously. It could be used to record gameplay without a capture card, and with negligible performance decrease compared to software recording solutions, and was available even on the previous generation GeForce 600 series cards. The software beta for ShadowPlay, however, experienced multiple delays and would not be released until the end of October 2013. A week after the release of the GTX 780, Nvidia announced the GTX 770 to be a rebrand of the GTX 680. It was followed by the GTX 760 shortly after, which was also based on the GK104 core and similar to the GTX 660 Ti. No more 700 series cards were set for release in 2013, although Nvidia announced G-Sync, another feature of the Kepler architecture that Nvidia had left unmentioned, which allowed the GPU to dynamically control the refresh rate of G-Sync-compatible monitors which would release in 2014, to combat tearing and judder. However, in October, AMD released the R9 290X, which came in at $100 less than the GTX 780. In response, Nvidia slashed the price of the GTX 780 by $150 and released the GTX 780 Ti, which featured a full 2880-core GK110 core even more powerful than the GTX Titan, along with enhancements to the power delivery system which improved overclocking, and managed to pull ahead of AMD's new release.
The GeForce 800M series consists of rebranded 700M series parts based on the Kepler architecture and some lower-end parts based on the newer Maxwell architecture.
See main article: GeForce 900 series.
In March 2013, Nvidia announced that the successor to Kepler would be the Maxwell microarchitecture. It was released in September 2014, with the GM10x series chips, emphasizing the new power efficiency architectural improvements in OEM, and low TDP products in desktop GTX 750/750 ti, and mobile GTX 850M/860M. Later that year Nvidia pushed the TDP with the GM20x chips for power users, skipping the 800 series for desktop entirely, with the 900 series of GPUs.
This was the last GeForce series to support analog video output through DVI-I. Although, analog display adapters exist and are able to convert a digital Display Port, HDMI, or DVI-D (Digital).
See main article: GeForce 10 series. In March 2014, Nvidia announced that the successor to Maxwell would be the Pascal microarchitecture; announced on May 6, 2016, and were released several weeks later on May 27 and June 10, respectively. Architectural improvements include the following:[17] [18]
See main article: GeForce 20 series and GeForce 16 series. In August 2018, Nvidia announced the GeForce successor to Pascal. The new microarchitecture name was revealed as "Turing" at the Siggraph 2018 conference.[24] This new GPU microarchitecture is aimed to accelerate the real-time ray tracing support and AI Inferencing. It features a new Ray Tracing unit (RT Core) which can dedicate processors to the ray tracing in hardware. It supports the DXR extension in Microsoft DirectX 12. Nvidia claims the new architecture is up to 6 times faster than the older Pascal architecture.[25] [26] A whole new Tensor core design since Volta introduces AI deep learning acceleration, which allows the utilisation of DLSS (Deep Learning Super Sampling), a new form of anti-aliasing that uses AI to provide crisper imagery with less impact on performance.[27] It also changes its integer execution unit which can execute in parallel with the floating point data path. A new unified cache architecture which doubles its bandwidth compared with previous generations was also announced.[28]
The new GPUs were revealed as the Quadro RTX 8000, Quadro RTX 6000 and Quadro RTX 5000. The high end Quadro RTX 8000 features 4,608 CUDA cores and 576 Tensor cores with 48 GB of VRAM. Later during the Gamescom press conference, Nvidia's CEO Jensen Huang, unveiled the new GeForce RTX series with RTX 2080 Ti, 2080, and 2070 that will use the Turing architecture. The first Turing cards were slated to ship to consumers on September 20, 2018.[29] Nvidia announced the RTX 2060 on January 6, 2019, at CES 2019.[30]
On July 2, 2019, Nvidia announced the GeForce RTX Super line of cards, a 20 series refresh which comprises higher-spec versions of the RTX 2060, 2070 and 2080. The RTX 2070 and 2080 were discontinued.
In February 2019, Nvidia announced the GeForce 16 series. It is based on the same Turing architecture used in the GeForce 20 series, but disabling the Tensor (AI) and RT (ray tracing) cores to provide more affordable graphic cards for gamers while still attaining a higher performance compared to respective cards of the previous GeForce generations.
Like the RTX Super refresh, Nvidia on October 29, 2019, announced the GTX 1650 Super and 1660 Super cards, which replaced their non-Super counterparts.
On June 28, 2022, Nvidia quietly released their GTX 1630 card, which was meant for low-end gamers.
See main article: GeForce 30 series. Nvidia officially announced at the GeForce Special Event that the successor to GeForce 20 series will be the 30 series, it is built on the Ampere microarchitecture. The GeForce Special Event introduced took place on September 1, 2020, and set September 17th as the official release date for the RTX 3080 GPU, September 24th for the RTX 3090 GPU and October 29th for the RTX 3070 GPU.[31] [32] With the latest GPU launch being the RTX 3090 Ti. The RTX 3090 Ti is the highest-end Nvidia GPU on the Ampere microarchitecture, it features a fully unlocked GA102 die built on the Samsung 8 nm node due to supply shortages with TSMC. The RTX 3090 Ti has 10,752 CUDA cores, 336 Tensor cores and texture mapping units, 112 ROPs, 84 RT cores, and 24 gigabytes of GDDR6X memory with a 384-bit bus.[33] When compared to the RTX 2080 Ti, the 3090 Ti has 6,400 more CUDA cores. Due to the global chip shortage, the 30 series was controversial as scalpers and high demand meant that GPU prices skyrocketed for the 30 series and the AMD RX 6000 series.
See main article: GeForce 40 series.
On September 20, 2022, Nvidia announced its GeForce 40 Series graphics cards.[34] These came out as the RTX 4090, on October 12, 2022, the RTX 4080, on November 16, 2022, the RTX 4070 Ti, on January 3, 2023, with the RTX 4070, on April 13, 2023, and the RTX 4060 Ti on May 24, 2023, and the RTX 4060, following in July of this year. More 40-series are due in 2024, such as the RTX 4050. These are built on the Ada Lovelace architecture, with current part numbers being, "AD102", "AD103", "AD104" "AD106" and "AD107". These parts are manufactured using the TSMC N4 process node which is a custom-designed process for Nvidia. The RTX 4090 is currently the fastest chip for the mainstream market that has been released by a major company, consisting of around 16,384 CUDA cores, boost clocks of 2.2 / 2.5 GHz, 24 GB of GDDR6X, a 384-bit memory bus, 128 3rd gen RT cores, 512 4th gen Tensor cores, DLSS 3.0 and a TDP of 450W.[35]
Since the GeForce 2 series, Nvidia has produced a number of graphics chipsets for notebook computers under the GeForce Go branding. Most of the features present in the desktop counterparts are present in the mobile ones. These GPUs are generally optimized for lower power consumption and less heat output in order to be used in notebook PCs and small desktops.
Beginning with the GeForce 8 series, the GeForce Go brand was discontinued and the mobile GPUs were integrated with the main line of GeForce GPUs, but their name suffixed with an M. This ended in 2016 with the launch of the laptop GeForce 10 series – Nvidia dropped the M suffix, opting to unify the branding between their desktop and laptop GPU offerings, as notebook Pascal GPUs are almost as powerful as their desktop counterparts (something Nvidia tested with their "desktop-class" notebook GTX 980 GPU back in 2015).[36]
The GeForce MX brand, previously used by Nvidia for their entry-level desktop GPUs, was revived in 2017 with the release of the GeForce MX150 for notebooks.[37] The MX150 is based on the same Pascal GP108 GPU as used on the desktop GT 1030,[38] and was quietly released in June 2017.[37]
Similar to the mobile GPUs, Nvidia also released a few GPUs in "small form factor" format, for use in all-in-one desktops. These GPUs are suffixed with an S, similar to the M used for mobile products.[39]
Beginning with the nForce 4, Nvidia started including onboard graphics solutions in their motherboard chipsets. These were called mGPUs (motherboard GPUs).[40] Nvidia discontinued the nForce range, including these mGPUs, in 2009.[41]
After the nForce range was discontinued, Nvidia released their Ion line in 2009, which consisted of an Intel Atom CPU partnered with a low-end GeForce 9 series GPU, fixed on the motherboard. Nvidia released an upgraded Ion 2 in 2010, this time containing a low-end GeForce 300 series GPU.
From the GeForce 4 series until the GeForce 9 series, the naming scheme below is used.
Category of graphics card | Number range | Suffix[42] | Price range[43] (USD) | Shader amount[44] | Memory | Example products | |||
---|---|---|---|---|---|---|---|---|---|
Type | Bus width | Size | |||||||
Entry-level | 000–550 | SE, LE, no suffix, GS, GT, Ultra | < $100 | < 25% | DDR, DDR2 | 25–50% | ~25% | GeForce 9400 GT, GeForce 9500 GT | |
Mid-range | 600–750 | VE, LE, XT, no suffix, GS, GSO, GT, GTS, Ultra | $100–175 | 25–50% | DDR2, GDDR3 | 50–75% | 50–75% | GeForce 9600 GT, GeForce 9600 GSO | |
High-end | 800–950 | VE, LE, ZT, XT, no suffix, GS, GSO, GT, GTO, GTS, GTX, GTX+, Ultra, Ultra Extreme, GX2 | > $175 | 50–100% | GDDR3 | 75–100% | 50–100% | GeForce 9800 GT, GeForce 9800 GTX |
Since the release of the GeForce 100 series of GPUs, Nvidia changed their product naming scheme to the one below.[45]
Category of graphics card | Prefix | Number range (last 2 digits) | Price range (USD) | Shader amount | Memory | Example products | |||
---|---|---|---|---|---|---|---|---|---|
Type | Bus width | Size | |||||||
Entry-level | no prefix, G, GT, GTX[46] | 00–45 | < $100 | < 25% | DDR2, DDR3, GDDR3, DDR4, GDDR5, GDDR6 | 25–50% | ~25% | GeForce GT 420, GeForce GT 1010, GeForce GTX 1630 | |
Mid-range | GTS, GTX, RTX | 50–65 | $100–300 | 25–50% | GDDR3, GDDR5, GDDR5X, GDDR6 | 50–75% | 50–100% | GeForce GTS 450, GeForce GTX 960, GeForce RTX 3050 | |
High-end | GTX, RTX | 70–95 | > $300 | 50–100% | GDDR3, GDDR5, GDDR5X, GDDR6, GDDR6X | 75–100% | 75–100% | GeForce GTX 295, GeForce GTX 1070 Ti, GeForce RTX 2080 Ti |
Basic support for the DRM mode-setting interface in the form of a new kernel module named nvidia-modeset.ko
has been available since version 358.09 beta.[52] The support of Nvidia's display controller on the supported GPUs is centralized in nvidia-modeset.ko
. Traditional display interactions (X11 modesets, OpenGL SwapBuffers, VDPAU presentation, SLI, stereo, framelock, G-Sync, etc.) initiate from the various user-mode driver components and flow to nvidia-modeset.ko
.[53]
In May 2022, Nvidia announced that it would release a partially open-source driver for the (GSP enabled) Turing architecture and newer, in order to enhance the ability for it to be packaged as part of Linux distributions. At launch Nvidia considered the driver to be alpha quality for consumer GPUs, and production ready for datacenter GPUs. Currently the userspace components of the driver (including OpenGL, Vulkan, and CUDA) remain proprietary. In addition, the open-source components of the driver are only a wrapper (CPU-RM) for the GPU System Processor (GSP) firmware, a RISC-V binary blob that is now required for running the open-source driver.[54] [55] The GPU System Processor is a RISC-V coprocessor codenamed "Falcon" that is used to offload GPU initialization and management tasks. The driver itself is still split for the host CPU portion (CPU-RM) and the GSP portion (GSP-RM).[56] Windows 11 and Linux propriatery drivers also support enabling GSP and make even gaming faster.[57] [58] CUDA supports GSP since version 11.6.[59] Upcoming Linux kernel 6.7 will support GSP in Nouveau.[60] [61]
Community-created, free and open-source drivers exist as an alternative to the drivers released by Nvidia. Open-source drivers are developed primarily for Linux, however there may be ports to other operating systems. The most prominent alternative driver is the reverse-engineered free and open-source nouveau graphics device driver. Nvidia has publicly announced to not provide any support for such additional device drivers for their products,[62] although Nvidia has contributed code to the Nouveau driver.[63]
Free and open-source drivers support a large portion (but not all) of the features available in GeForce-branded cards. For example, nouveau driver lacks support for the GPU and memory clock frequency adjustments, and for associated dynamic power management.[64] Also, Nvidia's proprietary drivers consistently perform better than nouveau in various benchmarks.[65] However, and version 3.16 of the Linux kernel mainline, contributions by Nvidia allowed partial support for GPU and memory clock frequency adjustments to be implemented.
The license has common terms against reverse engineering and copying, and it disclaims warranties and liability.[66]
Starting in 2016 the GeForce license says Nvidia "SOFTWARE may access, collect non-personally identifiable information about, update, and configure Customer's system in order to properly optimize such system for use with the SOFTWARE."[66] The privacy notice goes on to say, "We are not able to respond to "Do Not Track" signals set by a browser at this time. We also permit third party online advertising networks and social media companies to collect information... We may combine personal information that we collect about you with the browsing and tracking information collected by these [cookies and beacons] technologies."[67]
The software configures the user's system to optimize its use, and the license says, "NVIDIA will have no responsibility for any damage or loss to such system (including loss of data or access) arising from or relating to (a) any changes to the configuration, application settings, environment variables, registry, drivers, BIOS, or other attributes of the system (or any part of such system) initiated through the SOFTWARE".[66]
GeForce Experience is a program containing several tools including Nvidia ShadowPlay.[68]
Due to a serious security vulnerability before the March 26, 2019 security update, users of GeForce Experience were vulnerable to remote code execution, denial of service, and privilege escalation attacks.[69] When installing new drivers, GeForce Experience may force the system to restart after a 60-second countdown, without giving the user any choice.