Open In App

CDN Analytics with Azure Data Explorer

Azure CDN is a distributed network of servers that delivers online information to consumers quickly and effectively. To reduce latency, CDNs cache content on edge servers located near end users. Often, a CDN is used for distributing static resources to client applications, which is typically done via a website. The application might produce things at runtime and make them accessible to the CDN (for example, by building a list of current news headlines), but it does not do so for each request. 

CDN Logs:

Content Delivery Networks produce vast volumes of log Files when they transmit video over the internet to our homes and Mobile devices. These logs include essential information regarding the operation of the CDN servers and the quality of the video streaming. These logs range into terabytes of data and managing this amount of data in real-time and applying analytics to understand customer experience and network faults needs the right tools.



Depending on which CDN provider you use, accessing the logs will be different:

Azure Data Explorer allows you to ingest raw logs and traces and analyze the data by parsing, filtering, and aggregating the data very performantly. This example will show you the sort of information that is generally accessible in your CDN logs and explain how to analyze them:



90.54.15.123 username [23/Dec/2021:13:55:36 +0000] 
“GET /photo.jpg HTTP/2.0” 200 132 3445

CDN Analytics With Azure Data Explorer:

This solution illustrates low-latency high-throughput ingestion for massive amounts of CDN logs for constructing near real-time analytics dashboards.

 

  1. CDN providers such as Verizon and Fastly ingest massive volumes of CDN logs into ADX to assess the latency, health, and performance of CDN assets. 
     
  2. Most CDN implementations import data using Azure Storage (Blob or Azure Data Lake Storage Gen2), which uses Azure Event Grid and starts the ingestion pipeline to ADX. Alternatively, you can bulk ingest the data using the LightIngest tool. You can also continually export data to Azure Storage in compressed, partitioned Parquet format and easily query that data.
     
  3. ADX enables easy-to-use native operators and functions to process, aggregate, and analyze time series and log data, as well as give insights at lightning speed. You can develop near real-time analytics dashboards using ADX dashboards, Power BI, or Grafana. 
     
  4. Create and schedule alerts and notifications using an ADX connection for Azure Logic Apps.

CDN logs are sources of information that can help you better understand your users’ behavior, your company’s performance, and the frequency of fraudulent requests that come to your website. These insights are essential for learning and developing your service in order to scale safely and accomplish your goals. ADX provides an innovative query language, that is optimized for high-performance data analytics and is perfect for analyzing massive amounts of raw unstructured logs. If you have raw data already collected by logging systems, to demonstrate the real value that can be extracted from semi-structured data, ADX can be used to explore the raw data. In the final chapter, we will see how ADX can serve as a centralized analytics platform for logs. 

Article Tags :