Top 31 IT Monitoring software

Last updated: October 12, 2021

IT infrastructure monitoring software allows IT administrators to view entire IT infrastructure in real-time. IT monitoring comprises a broad class of products designed to let analysts determine whether IT equipment is online and performing to expected service levels, while resolving any detected problems.
1
We make machine data accessible, usable and valuable to everyone—no matter where it comes from. You see servers and devices, apps and logs, traffic and clouds. We see data—everywhere. Splunk offers the leading platform for Operational Intelligence. It enables the curious to look closely at what others ignore—machine data—and find what others never see: insights that can help make your company more productive, profitable, competitive and secure.
2
Monitor & manage your network with unified visibility into fault, performance, availability, traffic, & configurations. Get instant visibility across all system resources that can impact application performance and fix them fast. Find, analyze & resolve your toughest performance issues on SQL Server, Oracle, Sybase, and DB2. Automate 24x7 security monitoring and response to stop threats in their tracks and maintain continuous compliance.
3
New Relic gets you immediate code-level visibility to build faster software, create better products, and delight your customers. New Relic gets you immediate code-level visibility to build faster software, create better products, and delight your customers.
4
Nagios Is The Industry Standard In IT Infrastructure Monitoring. Achieve instant awareness of IT infrastructure problems, so downtime doesn't adversely affect your business. Nagios offers complete monitoring and alerting for servers, switches, applications, and services.
5
An open-source monitoring system with a dimensional data model, flexible query language, efficient time series database and modern alerting approach.
6
Spiceworks bundles network monitoring, helpdesk, UPS power management, PC inventory tools, an online community, and much more. All in one spot. All for free.
7
Zabbix is a mature and effortless enterprise-class open source monitoring solution for network monitoring and application monitoring of millions of metrics.
8
Business Software for Technology Providers. ConnectWise empowers technology solution providers to reach their goals faster with an award-winning business management software suite & much more.
9
Cloud monitoring reinvented. Easy, Automatic, AI-Powered. Automatically monitor dynamic hybrid-cloud environments.
10
PRTG Network Monitor is an agentless network monitoring software. It can monitor and classify system conditions like bandwidth usage or uptime and collect statistics from miscellaneous hosts as switches, routers, servers and other devices and applications.
Ad
  on Live Enterprise
11
Grafana is the open source analytics & monitoring solution for every database.
12
ManageEngine crafts comprehensive IT management software for all your business needs. We have complete and easy solutions for even your most difficult IT management problems, from keeping your business safe, to ensuring high availability, to making your users happy. We’re bringing IT together so you don’t have to.
13
Datadog is a monitoring service for IT, Operations and Development teams who write and run applications at scale, and want to turn the massive amounts of data produced by their apps, tools and services into actionable insight.
14
ELK is the acronym for three open source projects: Elasticsearch, Logstash, and Kibana. Elasticsearch is a search and analytics engine. Logstash is a server‑side data processing pipeline that ingests data from multiple sources simultaneously, transforms it, and then sends it to a "stash" like Elasticsearch. Kibana lets users visualize data with charts and graphs in Elasticsearch.
15
CloudWatch collects monitoring and operational data in the form of logs, metrics, and events, providing you with a unified view of AWS resources, applications, and services that run on AWS and on-premises servers.
16
Sumo Logic cloud log management solution processes ALL your production application logs and server log data, analyzes them in real-time and delivers actionable results at a fraction of the cost of on-premise solution.
17
Kaseya helps MSPs and IT Departments better manage their business with our IT management software suite, IT Complete.
18
Cisco's AppDynamics is an application performance management and IT operations analytics software. Get real-time insights and transform your app performance with our Application Performance Management to drive business outcomes.
19
Solve Operational Problems Faster. Make all of your logs accessible to everyone in one place. No more logging into individual machines. Use searches, filters and graphs to spot trends and narrow down potential root causes. Set up in minutes. No software or agents to install. Works with all standard logging facilities. Owned by SolarWinds.
20
Icinga is an open-source computer system and network monitoring application.
21
Cacti is an open-source, web-based network monitoring and graphing tool designed as a front-end application for the open-source, industry-standard data logging tool RRDtool. Cacti allows a user to poll services at predetermined intervals and graph the resulting data.
22
OpenNMS is a free and open-source enterprise grade network monitoring and network management platform.
23
logstash is a tool for managing events and logs. You can use it to collect logs, parse them, and store them for later use (like, for searching). Speaking of searching, logstash comes with a web interface for searching and drilling into all of your logs.
24
ScienceLogic’s solution is an all-in-one IT monitoring software product, comprehensive and complete in a single, unified platform. Unlike some companies, we do not sell you a la carte, piecemeal selections. To help you understand the many powerful components of our solution, we have broken out descriptions of features and capabilities that other companies call (and sell) as individual products:
25
Server Log Monitoring Tool. We built the log monitoring tool we've always wished for. Scalyr is server log monitoring and analysis built for engineers. Turn chaotic logs and system metrics into actionable data.
26
Graphite is an enterprise-ready monitoring tool that runs equally well on cheap hardware or Cloud infrastructure. Teams use Graphite to track the performance of their websites, applications, business services, and networked servers. It marked the start of a new generation of monitoring tools, making it easier than ever to store, retrieve, share, and visualize time-series data.
27
The Riverbed Network and Application Performance Platform enables organizations to visualize, optimize, accelerate and remediate the performance of any network for any application.
28
Network performance monitoring that goes beyond your network. AppNeta provides real-time, actionable insight from remote offices, over internal networks, and across the internet, so you can quickly get to the bottom of performance issues regardless of where they occur.
29
One platform, unlimited metrics. Monitoring Zen. We get your data flowing to us from anywhere, whether it's through our API, a turn-key solution, or one of our many open source collection agents. Turn your raw data into intuitive metrics with our powerful composite metric defi-nitions. Owned by SolarWinds.
30
OpsRamp is a service that helps IT teams discover, monitor, manage and automate their hybrid environments
31
Logit centralises and manages your logs and metrics. Automation, Analysis and Alerting built on ELK

Latest news about IT Monitoring software


2021. Cribl raises $200M to help enterprises do more with their data



Cribl, the company developing an “open ecosystem of data” for enterprises that utilizes unified data pipelines, called “observability pipelines,” to parse and route any type of data that flows through a corporate IT system, has raised a $200 million round of Series C. Cribl users can then choose their own analytics tools and storage destinations like Splunk, Datadog and Exabeam, but without becoming dependent on a vendor. Cribl also enables users to choose how they want to store their data, which is different from competitors that often lock companies into using only their products. Instead, customers can buy the best products from different categories and they will all talk to each other through Cribl


2021. CrowdStrike acquires logging startup Humio for $400M



CrowdStrike, a cloud-native cybersecurity company focused on endpoint protection and threat intelligence for enterprises, has announced plans to acquire U.K.-based log analysis and observability startup Humio in a mostly cash deal worth approximately $400 million. Humio emerged as a notable player in the cloud log-management and observability sphere after being founded out of London in 2016 with the native ability to ingest and analyze both unstructured and semi-structured data. The startup had secured more than $30 million in funding from backers like Accel and Dell, with customers including Microsoft and Bloomberg.




2021. SentinelOne to acquire high-speed logging startup Scalyr for $155M



SentinelOne, a late-stage security startup that helps customers make sense of security data using AI and machine learning, has acquired high-speed logging startup Scalyr for $155M. SentinelOne sorts through oodles of data to help customers understand their security posture, and having a tool that enables engineers to iterate rapidly in the data, and get to the root of the problem, is going to be extremely valuable for them. The company sees Scalyr as a product they can integrate into the platform, but also one which will remain a standalone


2021. Employee IT Experience monitoring provider Nexthink nabs $180M



Nexthink, a late-stage startup that wants to help IT serve its internal clients better, announced a $180 million Series D. Nexthink takes the IT monitoring problem from a completely different perspective. It gives IT department a real-time understanding of how employees are experiencing IT. This could involve querying the user base in the same way that HR and marketing survey tools allow companies to check the pulse of employees or customers. By gathering this type of data, it helps IT understand how employees are using the company’s technology choices. This software is aimed at larger organizations with at least 5,000 employees.


2021. Chronosphere nabs $43M to expand cloud native monitoring tool



Chronosphere, the scalable cloud native monitoring tool, has raised $43.4 million Series B and announced that their service is generally available. The startup founders created the open-source M3 monitoring project in 2019 and then launched Chronosphere, based on that project. M3 itself is a fairly complex piece of technology to run. It is solving a fairly complex problem at large scale, and running it actually requires a decent amount of investment to run at large scale, so the first thing we’re doing is taking care of that management. Chronosphere spent most of last year iterating the product and working with beta customers, adding that they certainly benefited from building the commercial service on top of the open-source project.


2020. New Relic acquires Kubernetes observability platform Pixie Labs



New Relic, the publicly traded monitoring and observability platform company has acquired Kubernetes observability platform Pixie Labs. The Pixie Labs brand and product will remain in place and allow New Relic to extend its platform to the edge. From the outset, the Pixie Labs team designed the service to focus on providing observability for cloud-native workloads running on Kubernetes clusters. And while most similar tools focus on operators and IT teams, Pixie set out to build a tool that developers would want to use. Using eBPF, a relatively new way to extend the Linux kernel, the Pixie platform can collect data right at the source and without the need for an agent.


2020. Splunk acquires network observability service Flowmill



Data platform Splunk continues to make acquisitions as it works to build out its recently launched observability platform. After acquiring Plumbr and Rigor last month, the company today announced that it has acquired Flowmill, a Palo Alto-based network observability startup. Flowmill focuses on helping its users find network performance issues in their cloud infrastructure in real time and measure their traffic by service to help them control cost. Like so many other companies in this space now, Flowmill utilizes eBPF, the Linux kernel’s relatively new capability to run sandboxed code inside it without having to change the kernel or load kernel modules. That makes it ideal for monitoring applications.


2020. Splunk acquires Plumbr and Rigor to build out its observability platform



Data platform Splunk today announced that it has acquired two startups, Plumbr and Rigor, to build out its new Observability Suite, which is also launching today. Plumbr is an application performance monitoring service, while Rigor focuses on digital experience monitoring, using synthetic monitoring and optimization tools to help businesses optimize their end-user experiences. Both of these acquisitions complement the technology and expertise Splunk acquired when it bought SignalFx for over $1 billion last year.


2020. Coralogix lands $25M Series B to rethink log analysis and monitoring



Logging and monitoring is expensive because of the sheer amount of data involved. Companies are therefore forced to pick and choose what they monitor, limiting what they can see. Coralogix wants to change that by offering a more flexible pricing model, and today the company announced a $25 million Series B and a new real-time analytics solution called Streama. It allows companies to pick and choose how they monitor and pay only for the features they use. That means if a particular log is only tangentially important, a customer can set it to low priority and save money, and direct the budget toward more important targets. The company differentiates itself from others out there like Splunk, Datadog and Sumo Logic, as a more modern approach to the problem that simplifies the operations.


2020. New Relic is changing its pricing model to encourage broader monitoring



In the monitoring world, typically when you spin up a new instance, you pay a fee to monitor it. If you are particularly active in any given month, that can result in a hefty bill at the end of the month. That leads to limiting what you choose to monitor, to control costs. New Relic wants to change that and is moving to a model where customers pay by the user instead, with a smaller, less costly data component. The company is also simplifying its product set with the goal of encouraging customers to instrument everything instead of deciding what to monitor and what to leave out to control cost.


2020. Log management platform Logit improves its performance



Popular log management and analysis platform Logit have just announced that their improved NVMe (Non-Volatile Memory Express) SSDs backed infrastructure is in place for all Elastic Stacks hosted on the platform. The team at Logit realised that many enterprise-level organisations have yet to reap the benefits of using NVMe for faster more efficient management & analysis of their log data so are pleased to be able to bring this improvement for all new and existing users. NVMe is one of the newest high-performance networking technologies changing the landscape of scalable infrastructure and assisting in decreasing infrastructure costs on a revolutionary scale. This is part of Logit’s continuous improvement of their platform & is offered free of charge to all users as of the end of June 2020.


2020. Cisco grabs ThousandEyes for network-intelligence technology



Cisco has acquired network intelligence software provider ThousandEyes for $1 billion. ThousandEyes firm offers a cloud-based software package that analyzes performance of local and wide-area networks and the internet. Utilizing its world-wide insights into service networks the company also tracks ISP, cloud and collaboration application performance via a family of intelligent cloud, enterprise and endpoint agents. Cisco said it envisions embedding ThousandEyes technology in a variety of its products including its AppDynamics application performance, SD-WAN, WebEx and Meraki families to enhance visibility across the enterprise, internet and the cloud.


2020. Kentik raises $23.5M for its network intelligence platform



Kentik, the company once known as CloudHelix, has raised a $23.5 million. The company’s platform allows enterprises to monitor their networks, no matter whether that’s over the internet, inside their own data centers or in public clouds. The company has been growing 100% compounded year-over-year since it launched in 2015 and now has customers in 25 countries. These include leading enterprises, SaaS companies, content providers, gaming companies, content providers and cloud and communication service providers, he tells me. Current customers include the likes of IBM, Zoom, Dropbox, eBay, Cisco and GoDaddy.


2020. Gartner named 15 top application performance monitoring vendors



Application Performance Monitoring (APM) vendors are expanding their ability to predict how application performance impacts digital business along with increased support for cloud infrastructure, says Gartner. COVID-19 has made it difficult for enterprises to ensure the performance of business-critical apps and the digital experience for their users. To solve these issues, Gartner said it requires the usage of digital experience monitoring (DEM) products – many of which are highlighted in Gartner’s Magic Quadrant -- deployed where employees are located or cloud-based APM solutions to provide complete visibility and meet performance goals.


2020. Checkly raises $2.25M for its monitoring and testing platform for DevOps teams



Checkly, a Berlin-based startup that is developing a monitoring and testing platform for DevOps teams, has raised a $2.25M seed round. The company’s SaaS platform allows developers to monitor their API endpoints and web apps — and it obviously alerts you when something goes awry. The transaction monitoring tool makes it easy to regularly test interactions with front-end websites without having to actually write any code. The test software is based on Google’s open-source Puppeteer framework and to build its commercial platform, Checkly also developed Puppeteer Recorder for creating these end-to-end testing scripts in a low-code tool that developers access through a Chrome extension.


2020. Humio raises $20M to advance unlimited logging tool



Humio, a startup that has built a modern unlimited logging solution, announced a $20 million Series B investment today. Humio wants to build a solution that would allow companies to log everything, while reducing the overall cost associated with doing that, a tough problem due to the resource and data volume involved. The company deals with customers who are processing multiple terabytes of data per day. What’s more, because the solution enables companies to log everything, rather than pick and choose what to log, they can ask questions about things they might not know, such as an on-going security incident or a major outage, and trace the answer from the data in the logs as the incident is happening.


2020. OpsRamp raises $37.5M for its hybrid IT operations platform



OpsRamp, a service that helps IT teams discover, monitor, manage and — maybe most importantly — automate their hybrid environments, today announced that it has closed a $37.5 million funding round. At the core of OpsRamp’s services is its AIOps platform. Using machine learning and other techniques, this service aims to help IT teams manage increasingly complex infrastructure deployments, provide intelligent alerting and eventually automate more of their tasks. The company’s overall product portfolio also includes tools for cloud monitoring and incident management.


2019. Splunk acquired cloud AIOps service SignalFx


Splunk, the data processing and analytics company, has acquired SignalFx for a total price of about $1.05 billion. SignalFx provides real-time cloud monitoring solutions, predictive analytics and more. The acquisition will likely make Splunk a far stronger player in the cloud space as it expands its support for cloud-native applications and the modern infrastructures and architectures those rely on. SignalFx will support Splunk's commitment to giving customers one platform that can monitor the entire enterprise application lifecycle.


2019. Sumo Logic raises $110 million to orchestrate cloud apps with AI



Sumo Logic, a cloud-native, machine data analytics platform delivering continuous app intelligence, has raised $110 million in a series G funding round. The infusion follows a breakout year in which Sumo Logic notched over $100 million in revenue and hit the 2,000-customer mark. It’s now valued at over $1 billion, has more than 500 employees, and counts among its client base Airbnb, Pinterest, The Pokémon Co... Sumo Logic spans 150 apps and integrations — provides analytics and insights to help clients build, run, and secure apps and cloud infrastructures. The funds will be used to expand Sumo Logic’s engineering, sales, and global operations teams, with an emphasis on extending the platform analytics capabilities of its various services.


2019. Zabbix 4.2 adds built-in support of Prometheus data collection



Zabbix Team announced the release of Zabbix 4.2. The latest release of Zabbix is packed with modern monitoring system for: data collection and processing, distributed monitoring, real-time problem and anomaly detection, alerting and escalations, visualization and more. The new version is able to collect data in many different ways (push/pull) from various data sources including JMX, SNMP, WMI, HTTP/HTTPS, RestAPI, XML Soap, SSH, Telnet, agents, scripts and other data sources, with Prometheus being the latest addition to the bunch. Now the 4.2 release will offer an integration with the exporters using native support of PromQL language. Moreover, the use of dependent metrics will give the Zabbix team ability to collect massive amounts of Prometheus metrics in a highly efficient way: this way they get all the data using a single HTTP call and then just reuse it for corresponding dependent metrics.


2018. Google Cloud adds new applications performance monitoring tool



Google added a key ingredient for developers building applications on the Google Cloud Platform - a suite of application performance management tools called Stackdriver APM. It is designed for developers to track issues in the applications they have built instead of passing that responsibility onto operations. The thinking is that the developers who built the applications and are closest to the code are therefore best suited to understand the signals coming from it. StackDriver APM is made up of three main tools: Profiler, Trace and Debugger. Trace and Debugger have already been available, but by putting them together with Profiler, the three tools work together to identify, track and repair code issues.


2018. SolarWinds acquires log-monitoring service Loggly



IT management company SolarWinds has acquired the cloud-based log-monitoring and analytics service Loggly. According to its marketing materials, about a third of the Fortune 500 use the company’s services, including the likes of Lenovo, Pizza Hut and Dell. SolarWinds argues that this acquisition will expand the company’s engineering and analytics expertise and that it will push the company’s overall strategy of building a full-stack monitoring platform. SolarWinds, which has acquired its fair share of companies (including Pingdom) over the years, will keep the Loggly brand and product alive.


2017. Splunk expands machine learning capabilities across platform



Cloud monitoring provider Splunk is enhancing machine learning capabilities to make it easier to surface the data that’s most critical. The Splunk Machine Learning Toolkit has several new key features designed for the DIYers. First of all, there is a new data cleaning tool to get the data ready for the model. Next, there are machine learning APIs to import open source and proprietary algorithms and apply them in Splunk. Finally, there is a machine learning management component to integrate user permissions directly from Splunk into custom machine learning applications. For those who would prefer Splunk to do some of the work for them, there are also new goodies including Splunk ITSI 3.0, which uses machine learning to help identify issues and prioritize them based on how critical a given operation is to the business.


2017. Cisco aquired application performance management software AppDynamics



Cisco acquires AppDynamics for a whopping $3.7 billion. AppDynamics helps its customers monitor the performance of business applications, which help them get a deeper look into the guts and inner workings of their applications. By having a better overview, companies can more effectively spot chokepoints and performance issues that may be potentially holding back transactions and other kinds of action. The idea there is to catch issues before they become major issues and huge headaches for those companies. For Cisco, that means that it’s going to get yet another touchpoint for companies and offer them a more complete stack to help them run their businesses.


2016. Splunk unveiled 300 machine learning algorithms for Operational Intelligence



Operational Intelligence platform provider Splunk has expanded in a significant way the early version of machine learning in its platform to deliver new services and capabilities. It has added machine learning to the core of its platform with a machine learning toolkit that can be installed as a free app on top of the Splunk Enterprise platform. This toolkit provides 300 algorithms for machine learning, 27 of which are pre-packaged out of the box and focus on such categories as clustering, recommendations, regression, classification and text analytics. Splunk has also enhanced the machine learning in its IT Service Intelligence (ITSI) platform it introduced this time last year.


2015. AppDynamics allows to track business transaction from user to server



AppDynamics, a company that helps monitor apps and websites in order to prevent outages (or at least understand why they happened and recover as quickly as possible) announced a major update. The latest version helps identify those big problems that take down a website, but also see much smaller ones that can have a negative impact on the collective or individual experience while using an app or website. In particular, it’s supposed to help customers surface patterns like multiple people having an issue with the shopping cart on an eCommerce site, or following an individual customer through what the company calls a ‘digital journey’, as he or she uses a mobile app or visits the company website. The goal is to help identify performance problems wherever they occur along the way.


2015. Application performance management provider AppDynamics raises $158M



Francisco-based company AppDynamics has closed on $158 million in a round led by General Atlantic and Altimeter Capital. AppDynamics makes software to monitor the performance of business applications, competing with some traditional firms like IBM, as well as younger outfits like New Relic, which went public last December and has seen relatively steady stock performance since. (New Relic, which raised $214 million in venture funding, has a current market cap of $1.8 billion.) AppDynamics had previously raised roughly $206 million in debt and equity, including a $120 million round — $70 million equity and $50 million of debt — that closed in July of last year.


2015. Splunk acquired machine learning startup Caspida



Cloud monitoring provider Splunk has purchased Caspida, the startup that uses machine learning techniques to help identify cyber-secruity threats from inside and outside the company. Splunk helps companies deal with the onslaught of machine data coming from IT systems using data science techniques and automation to make sense of it. The company has a security product called Splunk App For Enterprise Security. With this purchase, Splunk is adding a new tool to its security arsenal to beef up the ability to locate threats using the machine learning techniques that Caspida has developed. It can apply data science to look more closely at the usage of every person using the system, even if they appear to be a valid user with proper credentials. Splunk’s general approach is a solution that is data science driven, that provides automated detection and uses machine learning to get smarter over time.


2015. Microsoft acquires application performance management service BlueStripe



Microsoft  has acquired BlueStripe, an application performance management service that helps enterprises monitor and troubleshoot applications that have been deployed across different operating systems, data centers and cloud environments. This makes it easier for IT to help maintain service level objectives because the admins will be able to quickly figure out where the issues are and why they are happening. Microsoft says it will integrate the service into tools like System Center and the company’s Operations Management Suite (OMS). Microsoft will stop selling BlueStripe soon, as it starts to incorporate the service into its own products, but the company tells me that it will still support BlueStripe’s existing customers until then.


2015. Server log monitoring tool Scalyr raises $2.1M



Scalyr, a log-monitoring service that gives developers more insight into how their applications are performing, has raised a $2.1 million seed round. The idea behind Scalyr, then, is to get all data from server logs, as well as various metrics, error reports and other performance data and run it through a single tool that you can get actionable data from. Services like Splunk and Loggly offer similar log management services, but those tools make it easy to see anecdotes and it’s hard to roll that up into an overview. Scalyr also doesn’t see New Relic as a competitor either because that company’s focus tends to be on performance, while Scalyr focuses more on errors. Log files can quickly grow very large, and analyzing them takes quite a bit of compute power. The team built a new data management engine for its service to handle all of this data.