Data center management explained

Data center management[1] is the collection of tasks performed by those responsible for managing ongoing operation of a data center.[2] This includes Business service management and planning for the future.

Historically, "data center management" was seen as something performed by employees, with the help of tools collectively called data center-infrastructure management (DCIM) tools.[3]

Both for in-house operation and outsourcing, service-level agreements must be managed to ensure data-availability.[4]

Competition

See main article: Coopetition. Data center management is a growing major topic for a growing list of large companies who both compete and cooperate, including: Dell,[5] Google,[6] HP,[7] IBM,[6] Intel[7] and Yahoo.[7]

Hardware/software vendors who are willing to live with coopetition[8] [9] are working on projects such as "The Distributed Management Task Force" (DMTF)[10] with a goal of learning to "more effectively manage mixed Linux, Windows and cloud environments."

With the DMTF a decade old, the list of companies is growing, and also includes companies much smaller than IBM, Microsoft, et al.[11]

Focus

Among the topics currently being explored are:[12] scalability, securing data center networks, disaster recovery, government restrictions.[13]

Another major area is the cost of downtime regarding customer dissatisfaction & business loss,[14] and also the "astonishing" yet hidden cost and effect regarding personnel & productivity.[15]

Business-service management

Business-service management (BSM) treats IT as part of the larger enterprise strategy,[16] and helps fill the gap between business and IT.[17]

IBM notes that major problems often happen in the grey areas, particularly due to errors in the interfaces, and focuses on critical failures. Sufficient redundancy should allow failures in non-critical areas to protect the business from being affected.[17] BSM, which is positioned above IT Service Management (ITSM), promotes a customer-centric and business-focused approach to service management, aligning business objectives with IT or ICT from strategy through to operations. Tools that help BSM include a modeling language,[18] and a common dashboard, which together allow data center personnel to see problems before business customers do.[19]

Newer developments

Remote data center management[20] allows offsite experts to watch for situations needing their timely intervention at a lower cost than having such staff be onsite 24/7/365.

While some requirements for on-site hardware have been reduced,[21] spending in other hardware areas such as UPS may have to increase.[22]

Data center asset management

Data center asset management (also referred to as inventory management)[23] is the set of business practices that join financial, contractual and inventory functions to support life cycle management and strategic decision making for the IT environment. Assets include all elements of software and hardware that are found in the business environment.[24]

IT asset management generally uses automation to manage the discovery of assets so inventory can be compared to license entitlements. Full business management of IT assets requires a repository of multiple types of information about the asset, as well as integration with other systems such as supply chain, help desk, procurement and HR systems and ITSM.

Hardware asset managementHardware asset management entails the management of the physical components of computers and computer networks, from acquisition through disposal.[25] Common business practices include request and approval process, procurement management, life cycle management, redeployment and disposal management. A key component is capturing the financial information about the hardware life cycle which aids the organization in making business decisions based on meaningful and measurable financial objectives.
Software asset managementSoftware Asset Management is a similar process, focusing on software assets, including licenses. Standards for this aspect of data center management are part of ISO/IEC 19770.

Data center infrastructure management

Data center-infrastructure management (DCIM) is the integration[26] of information technology (IT) and facility management disciplines[27] to centralize monitoring, management and intelligent capacity planning of a data center's critical systems. Achieved through the implementation of specialized software, hardware and sensors, DCIM enables common, real-time monitoring and management platform for all interdependent systems across IT and facility infrastructures.

DCIM products can help data center managers identify and eliminate sources of risk[28] and improve availability of critical IT systems. They can also be used to identify interdependencies between facility and IT infrastructures to alert the facility manager to gaps in system redundancy, and provide dynamic, holistic benchmarks on power consumption and efficiency to measure the effectiveness of "green IT" initiatives.[29] [30]

Important data center metrics include those regarding energy efficiency and use of servers, storage, and staff. In too many cases, disk capacity is vastly underused and servers run at 20% use or less.[31] More effective automation tools can also improve the number of servers or virtual machines that a single admin can handle.

DCIM providers are increasingly linking with computational fluid dynamics providers to predict complex airflow patterns in the data center. The CFD component is necessary to quantify the impact of planned future changes on cooling resilience, capacity and efficiency.[32]

Operations

Information technology operations, or IT operations (ITOps), are the set of all processes and services managed by IT staff[33] for use by internal or external clients. The term refers to the application of operations management to the technology used to run the business.

Operations work can include responding to support tickets generated for maintenance work or customer issues.[34] Some operations teams provide on-call support, responding to incidents outside of normal business hours.

As lights out[35] operations increased, less of the staff are located near corporate headquarters.[36] [37] Gartner defines IT operations as "the people and management processes associated with IT service management to deliver the right set of services at the right quality and at competitive costs for customers."[38]

Technical support

Technical support (often shortened to tech support) refers to services. Within a corporation, these are also known as help desks[39] often arrange their technical support structure as a three-tier (plus two) system:[40]

The extra tiers are:[40]

Access to varying levels of support for products and services to in-house employees and corporate customers, providing informationand troubleshooting[41] is via various channels such as toll-free numbers,[42] websites, instant messaging, or email.

Help desk professionalism

An ITIL-compliant help desk is usually a part of a bigger service desk unit, which is part of ITSM.[43]

As the incoming phone calls are random in nature, help desk agent schedules are often maintained using an Erlang C calculation. Companies with custom application software may also have an applications team who are responsible for the development of in-house software. The help desk may assign to the applications team such problems as finding software bugs. Requests for new features or information about the capabilities of in-house software that come through the help desk are also assigned to applications groups. The help desk staff and supporting IT staff may not all work from the same location. With remote access applications, technicians are able to solve many help desk issues from another work location or their home office. While there is still a need for on-site support to effectively collaborate on some issues, remote support provides greater flexibility.

Some companies and organizations provide discussion boards for users of their products to interact; such forums allow companies to reduce their support costs[44] without losing the benefit of customer feedback.

Some fee-based service companies charge for premium technical support services.[45]

Outsourcing technical support

Many organizations relocated their technical support departments or call centers to countries or regions with lower costs. Dell was amongst the first companies to outsource their technical support and customer service departments to India in 2001, but then reshored.[46] There has also been a growth in companies specializing in providing technical support to other organizations. These are often referred to as MSPs (Managed Service Providers).[47]

For businesses needing to provide technical support, outsourcing allows them to maintain a high availability of service. Such need may result from peaks in call volumes during the day, periods of high activity due to introduction of new products or maintenance service packs, or the requirement to provide customers with a high level of service at a low cost to the business. It allows businesses to use specialized personnel whose technical knowledge base and experience may exceed the scope of the business, thus providing a higher level of technical support to their employees.

Scams

See main article: Technical support scam. A common scam typically involves a cold caller claiming to be from a technical support department of a company like Microsoft. Such cold calls are often made from call centers based in India to users in English-speaking countries, although increasingly these scams operate within the same country. The scammer will instruct the user to download a remote desktop program and once connected, use social engineering techniques that typically involve Windows components to persuade the victim that they need to pay for the computer to be fixed and then proceeds to steal money from the victim's credit card.[48]

Preventive maintenance

Preventive maintenance (or preventative[49] maintenance (PM)) is ongoing scheduled[50] inspection[51] intended to detect and correct incipient failures either before they occur or before they develop into major problems such as downtime.

Managing the capacity of a data center

With the increasing use of "the cloud" and what has been called "the Era of Infinite Capacity",[52] there is still a need for professional Data Center Capacity Planners.[53]

There is a need to know what will be needed, and when.[54] Data must continually be collected regarding usage of power/energy, computing power, data storage and networking/telecommunications. Plans must include awareness of cooling and space requirements.

Sometimes analysis of this data, and comparison to industry norms, can be outsourced.[54] The balance for the need to focus more on data collection[55] or analysis depends on current use levels: prior to 50%, the focus can stay more on data collection. Beyond 75%, the focus must shift to analysis, in preparation for upgrades, replacements and expansions. The data center is a resource in its own right.[56]

Top data centers and service providers

According to Cloudscene's Leaderboard for Q1 2018, data center operators are ranked "based on both data center density (total operated data centers)", as well as "the number of listed service providers in the facility". Cloud service providers are ranked based on "connectivity (the total number of PoPs) for the region." Chosen from a pool of more than 6,000 providers, the rankings are as follows:[57]

Q1, 2018 Top Data Center Operators Worldwide
RankNorth AmericaEMEAOceaniaAsia
1EquinixEquinixEquinixEquinix
2Digital RealtyInterxionNextDCGlobal Switch
3CoreSiteTelehouseVocus CommunicationsNTT Communications
4ZayoDigital RealtyGlobal SwitchGPX Global Systems
5Level 3 CommunicationsGlobal SwitchYourDCST Telemedia Global Data Centres
6CologixLevel 3 CommunicationsMacquarie TelecomNetmagic Solutions
7CyxteraitconiciseekAIMS
8TierPointColt Technology ServicesInteractiveDigital Realty
9Netrality PropertiesNikhefDatacomTelstra
10QTS Realty TrustOrange Business ServicesData Centre LimitedOneAsia Network
Q1, 2018 Top Service Providers Worldwide
RankNorth AmericaEMEAOceaniaAsia
1ZayoColt Technology ServicesTelstraColt Technology Services
2Level 3 CommunicationsEuNetworksVocus CommunicationsPCCW Solutions
3VerizonCogent CommunicationsPIPE NetworksTata Communications
4Crown CastleZayoOptusPCCW Global
5AT&TLevel 3 CommunicationsNextGen GroupTelstra
6Cogent CommunicationsBTAAPTNTT Communications
7CenturyLinkInterouteMegaportSuperloop
8XO CommunicationsVerizonSuperloopZenlayer
9ComcastOrange Business ServicesZencross ConnectChina Telecom
10TW TelecomNL-IXUecommSingtel

Notes and References

  1. News: The New York Times . What Startups in Amazon's Ecosystem Should Learn From VMware. May 3, 2009. GigaOm. their existing data center management .... live . https://web.archive.org/web/20230120152908/https://archive.nytimes.com/www.nytimes.com/external/gigaom/2009/05/03/03gigaom-what-startups-in-amazons-ecosystem-should-learn-f-12208.html . Jan 20, 2023 .
  2. Web site: What is Data Center Management? . live . https://web.archive.org/web/20211214153938/https://www.sunbirddcim.com/what-is-data-center-management . Dec 14, 2021 . Sunbird DCIM.
  3. Web site: Network World. Data-center management: What does DMaaS deliver that DCIM doesn't?. Ann Bednarz . May 24, 2018. live . https://web.archive.org/web/20231209114050/https://www.networkworld.com/article/965606/data-center-management-what-does-dmaas-deliver-that-dcim-doesnt.html . December 9, 2023 .
  4. . Cloud Computing and Service Level Agreements (SLAs). Christine . Taylor. April 17, 2017. live . https://web.archive.org/web/20221013020946/https://www.datamation.com/cloud/cloud-computing-and-service-level-agreements-slas/ . Oct 13, 2022 .
  5. Web site: . August 21, 2012. Dell Makes Moves to Survive in Cloud-Centric World . Quentin . Hardy . live . https://web.archive.org/web/20230116110501/https://archive.nytimes.com/bits.blogs.nytimes.com/2012/08/21/dell-makes-moves-to-survive/ . Jan 16, 2023 .
  6. News: The New York Times . October 8, 2007. Google and I.B.M. Join in 'Cloud Computing' Research . subscription. Steve . Lohr . live . https://web.archive.org/web/20231024213747/https://www.nytimes.com/2007/10/08/technology/08cloud.html . Oct 24, 2023 .
  7. News: The New York Times . Yahoo, Intel and HP Form Cloud Computing Labs . July 29, 2008. Juan Carlos . Perez . IDG News Service. live . https://web.archive.org/web/20230116110457/https://archive.nytimes.com/www.nytimes.com/idg/IDG_852573C4006938800025749500529A63.html?ref=technology . Jan 16, 2023 .
  8. News: . Coinages That Last . August 9, 2003. Many buzzwords, like coopetition and thought-leading,. subscription. live . https://web.archive.org/web/20230117133514/https://www.nytimes.com/2003/08/09/opinion/l-coinages-that-last-834360.html . Jan 17, 2023 .
  9. Web site: . November 7, 2005. The Online Travel Landscape Is Getting Crowded. ... which Internet analysts love to call "coopetition.". subscription. Bob . Tedeschi . live . https://web.archive.org/web/20230117133518/https://www.nytimes.com/2005/11/07/technology/the-online-travel-landscape-is-getting-crowded.html . Jan 17, 2023 .
  10. News: The New York Times . October 27, 2008. Meeting virtualization management challenges. John . Fontana . IDG. live . https://web.archive.org/web/20230117133518/https://archive.nytimes.com/www.nytimes.com/external/idg/2008/10/27/27idg-Meeting-virtual.html . Jan 17, 2023 .
  11. The Times article mentions "a crop of next-tier vendors, start-ups and open source players."
  12. The Data Center Journal. Data Center Management . October 28, 2018.
  13. Computer Weekly. Power struggle . October 26, 2018 . Matt Hancock. a row is brewing over an EU plan to curb datacentre energy use.
  14. News: Flights Cancelled for more than 75,000 passengers. Reuters. May 29, 2017.
  15. Web site: The astonishing hidden and personal costs of IT downtime (and how predictive analytics might help) . David Gewirtz . . May 30, 2017.
  16. Web site: What is business service management (BSM)? .
  17. Web site: Jenko Gaviglia . Business Service Management . https://web.archive.org/web/20181225130036/http://www-05.ibm.com/be/pdf/en/events/com_inf_solution_day/Skillteam_Common.pdf . 2018-12-25 . IBM.com.
  18. Book: 10.1007/978-3-642-22760-8_16 . The Business Service Representation Language: A Preliminary Report . Towards a Service-Based Internet. ServiceWave 2010 Workshops . Lecture Notes in Computer Science . 2011 . Ghose . A. K. . Lê . L. S. . Hoesch-Klohe . K. . Morrison . E. . 6569 . 145–152 . 978-3-642-22759-2 . https://ro.uow.edu.au/infopapers/1518 .
  19. News: Targeting hybrid IT environments. . . Bednarz . June 2010.
  20. Web site: Remote Data Center Management.
  21. Web site: Hard Times Could Create a Tech Boom . Quentin Hardy . November 17, 2012.
  22. Web site: In 2014, Proactive UPS Maintenance is Essential for all Data Center Managers. UPS-redundant configurations, providing backups for backups that have their own backups..
  23. or IT asset management (ITAM)
  24. Web site: . IT Asset Management (ITAM) . May 18, 2013 . January 15, 2019.
  25. News: . It's Easy, and Expensive, to Forget About Old Equipment . Brent Bowers . March 13, 2008.
  26. ECmag. Tracking All the Data: Data Center Infrastructure Management .... (DCIM) software enables ... integration ....
  27. Web site: Data Center Infrastructure Management – Data Center Handbook.
  28. . Measure and manage the risk inherent in your IT infrastructure . August 13, 2010.
  29. . September 9, 2018. Green Computing And Storage. Tom Coughlin .
  30. Web site: The New York Times. Mission: Green Computing" by Supermicro Introduces Total Cost. August 20, 2018. December 3, 2018. November 30, 2018. https://web.archive.org/web/20181130113121/https://markets.on.nytimes.com/research/stocks/news/press_release.asp?docTag=201808201100PR_NEWS_USPRX____SF82236%26feedID=600%26press_symbol=7084606. dead.
  31. Web site: Measuring Data Center Efficiency: Easier Said Than Done . Dell.com . June 25, 2012 . dead . https://web.archive.org/web/20101027083349/http://content.dell.com/us/en/enterprise/d/large-business/measure-data-center-efficiency.aspx . October 27, 2010 .
  32. Web site: Computational-Fluid-Dynamic (CFD) Analysis | Gartner IT Glossary. gartner.com. August 27, 2014.
  33. Web site: Computer Operators.
  34. Book: Site Reliability Engineering: How Google Runs Production Systems . 2016 . O'Reilly . 978-1-491-92912-4.
  35. . Premier 100 Q&A: HP's CIO sees 'lights-out' data centers . March 6, 2006.
  36. News: The New York Times. From Manhattan to Montvale . April 20, 1986.
  37. News: . Dell Sees Double With Data Center in a Container . Ashlee Vance . Ashlee Vance . December 8, 2008.
  38. Web site: IT Operations – Gartner IT Glossary . February 8, 2012. gartner.com.
  39. News: . An Update for the Corporate Help Desk . Quentin Hardy . October 30, 2012.
  40. Web site: IT Support Levels Clearly Explained: L1, L2, L3, and More . Joe Hertvik . July 7, 2016.
  41. Web site: IT Help Desks Not Just For Large Enterprises . May 12, 2019 . July 18, 2012 . https://web.archive.org/web/20120718030404/http://www.informationweek.com/news/smb/ebusiness/230800044 . dead .
  42. Web site: Students – Information technology – Calvin College . Calvin College . March 23, 2018.
  43. Web site: Help Desk vs Service Desk vs ITSM.
  44. Web site: How to Use Online Forums . . April 22, 2010.
  45. Web site: Technical support for the neighbours . March 6, 2008 . BBC News . March 28, 2005.
  46. https://www.nbcnews.com/id/wbna4853511 Dell moves outsourced jobs back to U.S. shores
  47. Web site: Call Centre Trends . May 2, 2008 . Berkley . Susan . Maggie Klenke . The Great Voice Company .
  48. News: Arthur. Charles. Virus phone scam being run from call centres in India. March 31, 2014. Guardian. July 18, 2012.
  49. News: . Wellness . Complaints about preventative go back to the late 18th century ... ("Oxford English Dictionary dates preventive to 1626 and preventative to 1655) ..preventive has won" . April 18, 2010. Ben Zimmer.
  50. Web site: What is Preventive Maintenance? . MicroMain.com.
  51. Web site: BusinessDictionary.com. What is preventive maintenance?. November 18, 2018. November 19, 2018. https://web.archive.org/web/20181119051648/http://www.businessdictionary.com/definition/preventive-maintenance.html. dead.
  52. Web site: Capacity Planning in the Era of Infinite Capacity. Samir Mehra . September 11, 2018.
  53. Web site: indeed.com (job search). Data Center Capacity Planner Jobs, Employment. 293 Data Center Capacity Planner jobs available on Indeed.com.
  54. News: . Room to grow: Tips for data center capacity planning . Thomas A. Limoncelli . Strata R. Chalup . Christina J. Hogan.
  55. since this consumes both computing and storage resources
  56. Book: 10.1109/ICAC.2007.28. On the use of fuzzy modeling in virtualized data center management. J Xu . M Zhao . J Fortes . R Carpenter. Fourth International Conference on Autonomic Computing (ICAC'07). 2007. 25. 978-0-7695-2779-6. 16153431.
  57. Web site: Cloudscene Rankings: Top Data Centers & Service Providers Worldwide. Cloudscene. November 9, 2018. October 26, 2018. https://web.archive.org/web/20181026064504/https://cloudscene.com/top10. dead.