Monetizing broadcasting data

Ensuring timely data availability for real time, mission critical data

24 February 2023 | Noor Khan

Ensuring timely data availability for real time mission critical data

Key Challenges

Our clients were facing regular delays in their reporting as a result of data delays and gaps. Dealing with real-time, mission-critical data, they required a supercharged ETL structure to considerably improve the data reporting speed.

Key Details

Service

Data Engineering

Technology

Databricks, AWS Airflow, Spark, AWS S3, SFTP (Secure File Test Protocol)

Industry

Media

Sector

Media

Key results

  • Significantly reduced data reporting time
  • Parallel processing of multiple clusters
  • High data reliability and availability
  • Robust data security measures and practices
  • Automated error resolving

A market leader, internationally renowned media and broadcasting company

Founded in 2002, our client has been around for over two decades and is an internationally known company dealing with broadcasting data for commercial use. With a mission of making high-quality technology and content affordable for everyone, they have established themselves as a market leader.

Date reporting delays

The client deal with real-time broadcasting and commercial data, and the data needs to be available in near real-time. Their existing ETL infrastructure built on AWS technologies such as Redshift was not able to meet their expectations and requirements. Our highly experienced engineers presented the solution of leveraging Databricks for the whole ETL process for the many benefits it offers including parallel processing with multiple clusters.

Immense real-time data

The client was dealing with considerably large datasets including the likes of 215 million records of commercial data being processed on an hourly basis, and 9 million records of content data being processed hourly. This data created eighty reports which need to be produced quickly and efficiently.

Ensuring timely data availability for real time mission critical data

Parallel processing on multiple clusters with Databricks

To significantly improve the speed of data availability and reporting, the client required a solution which would enable the parallel processing of multiple data clusters. The ETL pipeline infrastructure was built in line with customer requirements and the data flowed through multiple stages:

  • Raw data, extracted from the source
  • Staging
  • Silver
  • Gold
Ensuring timely data availability for real time mission critical data

Each stage carried out the processing tasks such as de-duplication, validation and cleansing to ensure the data is loaded to the destination was clean, without gaps and delays.

Why Databricks?

Databricks is a brilliant technology used for ETL processes with user-friendly dashboards to track the entire process and spot any errors. Our highly skilled data engineering team are proficient in Databricks and has utilised it for many client projects, therefore they were able to make the recommendations to help the client overcome their challenges.

Automated error resolving and reporting

The solution was built with automated error resolution in place which is offered by Databricks. Our engineers were able to set a number of tries that the system would make before the error was reported for manual intervention. This helps the data engineering team ensure that there are no data drops and delays and errors can be resolved quickly and efficiently. There are two main errors that there is a potential for occurring and these include:

  • Data drops – This would mean the batch processing is incomplete therefore the processing of that particular data batch would be repeated.
  • Data gaps – In this case, there are gaps in the data which would require a backfill to when the error first occurred.
Ensuring timely data availability for real time mission critical data

Ongoing operational monitoring and support of the ETL

Our client are thrilled with the high performance of the new and improved ETL infrastructure built on Databricks which is driving a new speed and efficiency for their reporting. Our expert data engineering continues to provide an ongoing operational monitoring and support service to the client to ensure data availability and accessibility at all times.

Ensuring timely data availability for real-time, mission-critical data with Ardent data engineering service

Ardents' team of highly skilled data engineers are proficient in world-leading data technologies and can make recommendations based on your unique needs and requirements. Whether you have a preferred tech stack or want expert guidance on the technologies right for your data and business, we can help. Are you facing any of these challenges:

  • Delayed data reporting turnaround
  • Data delays, gaps and dropouts
  • Slow data performance and speed

If you are, get in touch today to find out how we can help you unlock the potential of your data.


More Success Stories

Fine art storage & preservation software

Success Story

Making logistics simple

Logistics | Logistics, Software

Leader logistics software provider Our client is a leading logistics software provider in the UK. With over 3 decades of experience in the industry, they continuously look to innovate with technology. Their range of software products includes a warehouse management system and removal management software. They aim to remove the complexity of software and bring [...]

Read More... from Monetizing broadcasting data

warehouse management automation user-friendly app

Success Story

Three decades of experience in delivering software excellence

Technology | Logistics, Software

Well-established logistics software provider Our client is a software products company providing software to the logistics industry and their main product was administration solution software for removal companies. With almost three decades of experience, our clients are leaders in the removals sector. Since the start, they have gone from strength to strength in becoming a [...]

Read More... from Monetizing broadcasting data

Improving data turnaround by 80% with Databricks 4

Success Story

Fortune 500 company entertaining audiences for over two decades

Media | Broadcasting

Leading entertainment Fortune 500 company Headquartered in California, our client are a well-established Fortune 500 company worth over a few billion as of October 2022. They deal with a large scale of various broadcasting data including audience and commercial data. We have worked with them on a number of projects to help unlock the potential [...]

Read More... from Monetizing broadcasting data

Ardent Insights

Are you ready to take the lead in driving digital transformation?

Are you ready to take the lead in driving digital transformation?

Digital transformation is the process of modernizing and digitating business processes with technology that can offer a plethora of benefits including reducing long-term costs, improving productivity and streamlining processes. Despite the benefits, research by McKinsey & Company has found that around 70% of digital transformation projects fail, largely down to employee resistance. If you are [...]

Read More... from Monetizing broadcasting data

Stateful vs Stateless

Stateful VS Stateless – What’s right for your application?

Protocols and guidelines are at the heart of data engineering and application development, and the data which is sent using network protocols is broadly divided into stateful vs stateless structures – these rules govern how the data has been formatted, how it sent, and how it is received by other devices (such as endpoints, routers, [...]

Read More... from Monetizing broadcasting data

Getting data observability done right - Is Monte Carlo the tool for you (1)

Getting data observability done right – Is Monte Carlo the tool for you?

Data observability is all about the ability to understand, diagnose, and manage the health of your data across multiple tools and throughout the entire lifecycle of the data. Ensuring that you have the right operational monitoring and support to provide 24/7 peace of mind is critical to building and growing your company. [...]

Read More... from Monetizing broadcasting data