This article discusses APM and observability while highlighting free observability solutions and tools. I’ve also included several quotes collected from observability companies who offer free-level access to their platforms and composed a list of 40 noteworthy paid and open-source observability solutions.
Lastly, I hope to urge observability providers to extend free and open access to their platforms and software, beyond free-trials and without requiring any purchase or commitment. Free access, free tools and open-source observability software allow potential customers to easily compare and narrow down what works best for them.
Organizations that adjust to allow open-source, free-engagement and free-tiers of service, will better position their platforms in 2021 and beyond to emerge as observability leaders.
The observability market is fast-evolving and becoming more complex in order to meet the requirements of increasingly complicated systems and software. It is essential then that those new to observability and all that it encompasses, can explore, learn, and experiment with the various technologies, tools, and providers available to choose from.
This approach empowers customers to match with the most suitable platform(s) for application performance metrics, traces, logs, and so on. In light of recent events, free and open also adds the potential of enhanced software and infrastructure security.
Last year, I asked 12 APM companies about the future of APM, and more recently, 20 leading companies shared how COVID-19 is accelerating the future of APM. These articles covered the expansion of APM into observability and the increasing importance of both practices. However, in true 2020 fashion, many top APM organizations have almost overnight augmented APM into observability.
What is APM?
Application Performance Monitoring (or APM) refers to monitoring and managing the performance and availability of software applications. This is typically accomplished by using APM software to monitor applications and the underlying infrastructure. With APM, we can gather performance metrics data, such as error rates, slow requests, system resource usage, response times, and so on, to aid in application performance optimization. APM is essentially a subset of observability.
Observability is being defined right now!
Observability helps developers find what’s slow, what’s broken, and what can be done to improve the performance of complex multi-layered architectures.
Now that infrastructure, software and applications are becoming exponentially more complex, our ability to see and especially predict what’s going to break has never been more important. The focus on observability has exploded this year. In fact, most of the companies in this article are already in the process of going beyond APM, to observability.
For example, in June 2020, I reached out to New Relic’s former Senior Vice President of Marketing, Tristan Bishop, to discuss the company’s monitoring. Part of his response was “…we don’t consider ourselves an APM company”. See How New Relic went from being an APM company to an observability platform.
In this vein, I’d like to draw your attention to a massive ongoing battle between competitors in the observability space. Right now, organizations are still trying to define observability, or rather, what aspects falls under its umbrella. In the past few months, just as one company publishes its observability definition, another will then publish theirs.
So, what is observability? Well, rather than adding to the many different published definitions, I’ll point you to a few articles and pages recently published (all within the past six months) by some of the organizations currently defining the future of observability:
- Observability: A complete overview for 2020 – Lightstep (Published: April 2020)
- What is observability? – New Relic (Published: July 14th, 2020)
- Three Pillars of Observability – Datadog. (Published: August 2020)
- What is observability? – Epsagon (Published: August 2020)
- What is observability? – Dynatrace (Published: August 28th, 2020)
- What is observability? Discerning myths from facts – Site24x7 (Published: September 2020)
- Monitoring vs Observability: What’s the Difference? – LogicMonitor (Published: September 22nd, 2020)
- A New Platform for Multicloud Observability and Management – Oracle (Launched: October 6th, 2020)
- Going Beyond the Three Pillars of Observability – Broadcom (Published: October 22nd, 2020)
- Observability Matters: Why IBM’s Acquisition of Instana Is a Game Changer for AIOps – Instana / IBM (Announced November 2020)
14 Companies offering Free-Tier plans and other Free Observability Solutions.
Those searching for better observability should have the opportunity to compare and make use of the full range of solutions, whether free or paid. For example, many of the companies who responded below offer free-tier plans which allow access to most, or in some cases all, platform functionalities and features.
Here’s how some of these observability and monitoring organizations responded when asked to… share with readers what they can make use of and experience free-of-cost.
(The following quotes were collected between November 9th and December 1st 2020)
“AWS infrastructure monitoring is effortless with Instrumental. After connecting to our industry-leading AWS Cloudwatch integration, you’ll instantly have dashboards, graphs, and alerts without any additional setup or configuration. Every graph and alert includes a clear and concise description of what is being monitored and why it’s important.
Our free AWS monitoring plan is ideal for single developers and small businesses but is also useful for evaluating Instrumental and testing the setup process in larger organizations. It offers the same features and functionality as our paid plans but limits the number of metrics and length of data visibility. The free plan is limited to 500 metrics and three hours of data visibility, where our paid plans have unlimited metrics and 13 months of data visibility.
With a free plan to get started, and transparent, flexible pricing as you grow, Instrumental is the ideal AWS infrastructure monitoring platform for any stage company.”
– – – –
Chief Strategy Officer
Datadog is a SaaS-based monitoring and security platform that includes infrastructure monitoring, application performance monitoring, user experience monitoring, and more. Our intent with our free offerings is to provide an introduction to the platform for students, early startups, and personal use. To that end, we offer free Pro accounts for students, and startups can join our Datadog for Startups program for free access as well.
On top of that, anyone can start a free two-week trial of the entire platform, or use the free tier for infrastructure monitoring on five or fewer hosts indefinitely. The free tier offers 15-second metric granularity, 400+ out-of-the-box integrations for popular technologies, and no caps on data volume or users. While the free tier can stand on its own, a paid account will allow you to send custom metrics, retain metrics for 15 months, and jump from a metric spike to related logs or traces while staying in the platform.
If you’re interested in learning more about monitoring best practices, we offer courses free of charge at learn.datadoghq.com, which cover topics like Kubernetes monitoring, how to build better dashboards, and how to work with SLOs. We also regularly post in-depth articles on how to monitor particular technologies, regardless of whether you’re using Datadog or not, on our blog, The Monitor. Recent topics include key metrics for VMWare vSphere, Istio, Lambda, and OpenShift.
If you’d rather get your hands dirty with development, you can also check out our GitHub page, which includes published open-source projects like our agent and DogStatsD. If you do end up working on your own integration or a new version of the agent, you can contact email@example.com to list it on our Marketplace and sell it to other Datadog users with no program or initiation fees.
– – – –
Director of Product Management
Epsagon enables teams to instantly simplify, visualize, and understand what’s happening within complex microservice architectures. With our comprehensive lightweight auto-instrumentation, users are able to eliminate the gaps in data and manual work associated with other APM solutions, providing significant reductions in issue detection, troubleshooting, and resolution times. Epsagon aggregates, unifies, analyzes, and correlates data from all the third party tools you love, delivering a single pane of glass for understanding containers, Kubernetes, serverless, and more. Developers now know when something is wrong and can immediately trace the issues to root cause before they affect production. See how you can increase development efficiency and reduce application downtime with Epsagon through these free plans:
Free Trial: A 14-day full-functionality trial of Epsagon, no credit card required. Including up to 10M traces per month, unlimited users, and full support via chat or email.
Free Tier: Up to 1K traces per month, metrics and alerting functionalities, unlimited users, and full support via chat or email.
“Organizations are increasingly adopting a complex ecosystem of microservices and containers spread across distributed environments, to quickly launch applications and meet market demands. As these systems increase in complexity, the challenges of understanding, debugging, and optimizing them are also growing exponentially.
DevOps and IT teams need to look for full-stack observability solutions that provide granular insights into the behaviour of systems along with rich context, to ensure that applications are equipped to maintain service delivery and avoid performance bottlenecks. They need to proactively monitor, debug, and tune application and infrastructure performance, and in turn, deliver better end-user experiences. With advanced observability tooling in place, businesses can be confident of proactively noticing and catching bugs after code is deployed to production, even if they follow a mature DevOps process that involves frequent code changes.
Organizations can easily get started with observability in a small dev or test environment using the free version of ManageEngine Applications Manager. They can then easily transition to a premium package once they are ready to expand and move into production.”
Sylvester Jayan P R
DevOps teams across verticals need tools that will help them in troubleshooting the various issues that they may face on a day to day basis. These issues can occur in different areas such as network infrastructure, applications, web design etc.
At Site24x7, we have developed over 60 free tools which will aid the DevOps teams in troubleshooting issues effectively.
Some of the free tools we provide for efficient troubleshooting are:
- Domain tools that check the availability and performance of websites, DNS, TCP service etc. from over 110 locations worldwide.
- Website security tools that help to check whether any of your assets are blacklisted.
- AWS Designer tool that helps in designing and visualizing your AWS infrastructure.
- Developer tools that help in compressing JS, formatting JSON/XML etc.
For our full list of free tools, click here.
Director of Product Management
“SolarWinds Believes observability is about understanding the complete state of your application and infrastructure environment. Developers and support personnel must have comprehensive and consistent infrastructure and application performance metrics, visibility into application performance down to the line of code, the ability to trace service relationships, and a comprehensive view into related logs.
With their integrated APM experience, SolarWinds® AppOptics™ and Loggly® deliver affordable, easy to use, and comprehensive observability across custom applications and their supporting infrastructure. The ability to immediately view specific log entries associated with given traces provides full-stack observability for developers to understand everything affecting the applications they’re developing—helping ensure optimal performance when applications “go live.” The free plans are designed so developers can test the benefits of complete observability before committing to a solution.
SolarWinds AppOptics Dev Edition is a free, full-function APM tool for testing and troubleshooting application performance before moving into production, with the same capabilities as SolarWinds AppOptics. Intended for pre-production use by application developers, AppOptics Dev Edition delivers comprehensive observability into infrastructure and application metrics, code profiling, and tracing across services, hosts, and containers for proactive application performance troubleshooting pre-production. All functionality of paid AppOptics plans is available, with limits on the quantity of metrics, hosts, and containers visible.
Sharing the same user experience as AppOptics, SolarWinds Loggly Lite is a free, comprehensive log aggregation, management, and search solution delivering application developers powerful log aggregation, analysis, and search tightly integrated with application and infrastructure performance metrics and traces. Loggly Lite is limited in terms of production features (e.g., no alerting, log archival, etc.) and duration of log retention (7 days) but is targeted at developers and is easily upgradeable to a paid plan with additional capabilities. Initial 30-day trials are full-function and automatically revert to Loggly Lite if a paid plan isn’t purchased.
In addition to free observability tools, SolarWinds offers several additional free tools for IT professionals.”
You can only speak about observability if all collected information is made available to all members of an organization. However, many organizations suffer from a sprawl of monitoring tools working in isolation. This bears the risk that problem root cause analyses or performance optimization are being slowed down by people working in silos and tools not bridging these silos. The monitoring information is there, somewhere, but the way the data is collected and processed is so inefficient that organizations are getting far away from being agile.
You need to accept that developers, sysadmins, and other actors in the DevOps world use different tools designed for their individual needs. You can only make observability work if you are able to integrate the tools of choice efficiently with each other. This means bi-directional information-sharing without having tools and people doing the same jobs over and over again.
Any team within an organization must open itself to collaboration. IT operations can make the first step. With Checkmk, they have an easy entry into the process, because Checkmk integrates with Prometheus, but is also able to provide data on other infrastructure assets, containers, and applications. Sysadmins can bring in their experience, create precise alarms, and provide value to developers by identifying issues that would otherwise have been missed.
You can start today with the Checkmk Free Edition: https://checkmk.com/
Provide a World-Class End User Experience
“As a trusted SaaS monitoring and performance testing platform for organizations around the world, Dotcom-Monitor provides many free tools, trials, and resources for individuals and DevOps teams looking to enhance the performance of their websites, applications, APIs, servers, and infrastructure in a practical, cost-effective way.”
Dotcom-Tools: Free Website Performance Tools
“From web page speed tests and ping tests to network tracing and web application testing, the free tools we offer at Dotcom-Tools can provide users with an instant snapshot of uptime, availability, and performance of their websites, servers, applications, and network. Try all the free tools from Dotcom-Tools.”
Dotcom-Monitor: All-In-One Performance Monitoring Platform
“Step up to our professional suite of solutions to gain an even better picture of speed, availability, performance, and trends with the Dotcom-Monitor platform. The platform provides several monitoring options and packages to automate monitoring for all your web pages, applications, APIs, and servers from 30 worldwide locations. Sign up free for 30 days to begin 24/7 monitoring and receive alerts with detailed reports when errors are detected so you can minimize downtime.”
LoadView: Real Browser-Based Performance Testing from the Cloud
“LoadView provides best-in-class load testing features, such as multiple load test curves types, point and click scripting with the free EveryStep Web Recorder, and testing applications from behind the firewall. Optimize the performance of your systems and quickly identify performance bottlenecks to improve the user experience. Start your LoadView free trial today and receive $20 in load testing credits to start.”
“Learn more about all the Dotcom-Monitor solutions through our Knowledge Base articles, videos, and tutorials.”
Web Performance Engineer
Because every part of Instana’s automated APM solution is critical to handling the complexity and constant change inherent in today’s applications, Instana made sure that any Free Trial (available at https://instana.com/trial) includes ALL functionality, as if it were purchased production product – no limit on applications, transactions, locations or users. It’s the only solution to combine automation, context and intelligence to deal with the needs of multi-cloud, hybrid cloud and Cloud-Native microservice applications:
- Automatic discovery, mapping and monitoring of 300+ technologies, including 14 languages
- End-to-end distributed tracing of every request and production profiling of every process
- Real-time change detection with immediate feedback on updates, integrated with CI/CD tooling
- Automatic Root Cause Analysis – showing triggering events and likely cause of any service issues.
- Correlated inclusion of all open source observability metric and tracing sources
- Cross-correlated analytics, including metrics, traces, profiles and open source information
Trial users will want to take advantage of the unlimited seats, as Instana’s APM solution has always been popular and useful across the ranks of application stakeholders – including Developers, IT Ops, DevOps, SRE and even executives. In some clients, Instana has close to 1,000 daily users of the solution.
While the automated root cause analysis is one part of the wide usage, it’s really Instana’s Application Perspectives that make it widely popular across dev teams. This identification and grouping of specific entities (such as all the services delivered by an individual developer or a Dev Team) and the filtering of metrics, transactions and profiles to just those that touch the grouping allows every user to get the exact data they need to do their work.
Even alerting and root cause analysis is automated, making Instana the easiest solution to setup and use to make sure that your applications operate at their optimal performance.
“This year we have started to see that Observability has shifted from being a buzzword to being a key, strategic topic of interest for enterprise IT monitoring buyers, because 2020 has accelerated the need for consolidation of monitoring products. This is being driven by increasing cloud adoption and cloud-first strategies, that are leading to greater application, system and IT Infrastructure complexity, meaning that contextual, holistic insight into performance and availability has become a must have and no longer a nice to have.
Simply, multiple, siloed monitoring products equals data that is out of context in regards to overall IT service performance and therefore can’t easily be utilised strategically to make fast, accurate decisions or to trigger proactive automation. The principles of Observability counteract this, by focusing monitoring products on a platform approach that is able to collect metric, application and log data in one platform, and utilize AI/ML to automate insight e.g. Root Cause Analysis and pre-emptive, dynamic capacity management.
At LogicMonitor, we believe that the next evolution of IT monitoring capabilities focuses on ‘Unified Observability’ or one platform for the collection and automatic contextual analysis of metric, application and log data. It’s why our recent release of LM Logs is fully integrated with our ability to monitor IT Infrastructure technologies. You can read more about this here and download a free trial of our platform here.”
VP, Product Marketing and Sales Enablement
Lumigo is a serverless monitoring and debugging platform. Our goal is to save developers time by shortening the cycle from issue discovery to fix. We do this with our deep distributed tracing of all AWS managed services, and by correlating the relevant data so you don’t have to go around chasing for it with tedious and repetitive manual work. And everything is searchable in a super-intuitive way.
In addition to the ease of finding and fixing problems, developers love that Lumigo is so easy to set up. It literally takes 4 minutes and 4 clicks, with no code changes or configuration.
Lumigo offers the most generous free tier in the industry: 1M invocations and 500K traced invocations (when you enter the promo code Hayden500). This free tier has no functional limitations. We want to make it easy for developers during the dev phase or when it’s still a small project, and only pay when Lumigo brings real commercial value. You can sign up for your free account at https://platform.lumigo.io/signup.
Lumigo has also shared with the serverless community the free, open-source Lumigo CLI tool, a “Swiss army knife” for various AWS serverless related tasks, such as analyze-lambda-cold-starts, replay-sqs-dlq, and powertune-lambda. You can get it on npm.
Co-founder and CEO
“Netdata is an open-source, infrastructure monitoring solution built for troubleshooting. It monitors the performance and health of systems and applications in real-time. The approach is simple but unparalleled: we offer zero-configuration monitoring with unlimited metrics and data visualizations that are organized and built for troubleshooting, providing increased observability of your infrastructure, all for free. Netdata is different from other solutions because it doesn’t backhaul data which can limit the number and frequency of metrics collected. Instead, Netdata’s distributed data architecture keeps metrics resident on their originating systems and only streams metrics on demand to provide a real-time view of the entire infrastructure at a per-second granularity. An important element is our dedicated community that has supported their accelerated growth by contributing their time and expertise to resolve issues, provide feedback, and even assist in developing new features.
With Netdata, you get access to unlimited, highly granular metrics all in real-time, prebuilt dashboards, and intelligent alarms that provide complete visibility and insights to help you scale, for free. Get up and running within minutes with their one-line deployment that requires zero configuration. Immediately access prebuilt charts and alarms with intelligent defaults to help you realize value immediately. Netdata is also designed to be open and interoperable with other solutions in your monitoring toolchain. Integrate with any number of prebuilt technologies to instantly collect thousands of metrics from systems and apps.
– – – –
Senior Product Manager
“Powerful monitoring tools should help developers quickly find and fix performance abnormalities without having to deal with the headache of enterprise-platform feature bloat. Scout APM was designed specifically for users to gain instant observability of your apps. This tool provides out of the box instrumentation of all web endpoints, background jobs, as well as many common libraries.
Scout APM offers a free plan to any open-source project, all you have to do is send them the Github link and testimonial to gain immediate insights. Additionally, Scout APM offers a free 14-day trial to all users, regardless of your plan, to give users an idea of how app monitoring can drastically improve the end-user experience.”
– – – –
“Sematext offers 4 solutions
Infrastructure and Application Performance Monitoring (think metrics), Log Monitoring, Frontend Monitoring (aka User Experience or Real User Monitoring), and Synthetic Monitoring (aka Website or API Monitoring). Each solution has its own independent 14-day free trial period, so you are not forced to rush and evaluate everything Sematext has to offer in 14 days.
Infrastructure and Logs Monitoring solutions each have a free plan. One can change plans at any time. You can downgrade from a paid plan to a free plan and vice versa. No interactions with any salespeople are needed. As a matter of fact, as of this writing Sematext has exactly 0 sales people. Onboarding is intuitive and self-service all the way.
All plans and their prices are easily visible on the Sematext pricing page. Other vendors have account-level plans, data retention, etc. which means you pay for every single thing you send to their service.
Sematext is different.
With Sematext you can mix and match plans, so you can use free plans for monitoring some things and paid plans for monitoring other things. This gives our users lots of control over their costs for all different types of monitoring accessible via Sematext’s unified, full-stack observability platform.”
Sematext Group, Inc.
Observability Leaders in 2021 and Beyond.
There are loads of providers to choose from; most tout elaborate GUIs, proprietary software, and emerging technologies that users need to master. Companies seeking better monitoring and observability need to be able to narrow down their top choices quickly—weeks, rather than spending months on scheduled demos and sales calls. Observability providers need to empower decision-makers with access that enables them to freely examine, compare and test their platforms.
Observability organizations shifting to allow more free engagement with their platforms will be best positioned in 2021 and beyond to emerge as leaders in this market. Thus, allowing developers to test and explore options during their platform selection process without committing to any particular provider.
20 Recommended Premium Observability and Monitoring Platforms.
- Appoptics – Infrastructure and application performance monitoring.
- Appdynamics – Full-stack observability to drive business decisions.
- Aternity – Digital Experience Management, and more.
- Broadcom – AIOps and Observability.
- Datadog – Cloud Monitoring as a Service.
- Dynatrace – Automatic and intelligent observability.
- Epsagon – Modern Observability for Modern Applications.
- Fortinet – Strengthens NetSec Ops with SaaS-based monitoring.
- Instana – APM Observability sandbox. (being acquired by IBM)
- Instrumental – Easy-to-use application and server monitoring.
- Lightstep – Full-context observability.
- Logz.io – ELK, open-source Grafana, and Jaeger on a scalable platform.
- ManageEngine – 90+ Products and free tools.
- New Relic – Observability made simple.
- Oracle – Cloud Observability and Management Platform. (new!)
- Sematext – Troubleshooting made easy with Sematext Cloud.
- Site24x7 – All-in-One Monitoring Solution.
- Solarwinds – Observability tools and solutions.
- Splunk – Observability suite.
- Sumo Logic – Modern App Reliability with Sumo Logic Observability.
Free and Open-Source Observability Solutions.
Observability and its capabilities continue to expand at an accelerating rate. Yet, to many, the term itself is still relatively new. As you’ve read above, organizations are now working to explain and market their observability platforms. Several offer free tools, resources, open-source solutions, and even free-tier plans.
20 Open-Source and Free Observability Solutions
- Cilium – eBPF-based Networking, Security, and Observability.
- Cloudprober – Active monitoring software to detect failures before your customers do.
- Datav – Observability for metrics/logs/traces/big-screens
- Elastic Stack – Take data from any source, in any format, then search, analyze, and visualize it in real-time.
- Express Status Monitor – Real-time monitoring solution for Node.js/Express.js apps.
- Fluentd – Unified Logging Layer
- Grafana – the open observability platform.
- Graphite – A highly scalable real-time graphing system.
- Icinga – Monitor Your Entire Infrastructure.
- Influxdata – Scalable datastore for metrics, events, and real-time analytics
- Jaeger – Open-source, end-to-end distributed tracing.
- Kibana – Visualize your Elasticsearch data and navigate the Elastic Stack.
- Kubesphere – Enterprise-grade container platform tailored for multi-cloud and multi-cluster management.
- Netdata – Open-source monitoring.
- Opstrace – Secure observability, deployed in your own network.
- Prometheus – Monitoring system and time series database.
- Sentry – Open-source error tracking with full-stack traces & asynchronous context.
- Skywalking – Application performance monitor tool for distributed systems.
- Vector – A high-performance, end-to-end observability data platform.
- Zipkin – Distributed tracing system.