Fortune 500 company entertaining audiences for over two decades

Improving data turnaround by 80% with Databricks

7 November 2022 | Noor Khan

Improving data turnaround by 80% with Databricks 4

Key Challenges

Our client were having trouble with slow turnaround with their data reporting. The reports were taking significant time due to the delay in data processing.

Key Details

Service

Data Engineering

Technology

Databricks, PySpark, PagerDuty, Cloud Watch

Industry

Media

Sector

Broadcasting

Key results

  • 80% quicker data reporting time
  • Expert recommendations on fitting technologies
  • 215 million rows of data processed hourly
  • Cost efficiency
  • Continuous improvements and optimisation

Leading entertainment Fortune 500 company

Headquartered in California, our client are a well-established Fortune 500 company worth over a few billion as of October 2022. They deal with a large scale of various broadcasting data including audience and commercial data. We have worked with them on a number of projects to help unlock the potential of data by continuously improving and optimising data performance.

Improving data turnaround by 80% with Databricks

Data delays and slow reporting

Our client deal with huge volumes of data and were having delays in their reporting. They were running around 80 reports and each report took around 4 to 5 minutes to be produced. The data reports delay of each report adds a considerable amount of time to the reporting time of the full 80 reports. Therefore, our client were looking for an alternative solution to significantly improve their data reporting turnaround.

Improving data turnaround by 80% with Databricks

Our clients existing data structure was built on Amazon Redshift, which is a powerful technology, however, it presented data delay challenges for our clients' data. Therefore, we recommend Databricks as the alternative to processing data quickly and efficiently. Databricks offered scalable, efficient and quicker processing of data with the use of independent clusters which can run parallel.

Improving data turnaround by 80% with Databricks

Databricks clusters

Our highly experienced data engineers created three clusters on Databricks including Cluster A for storing all data, Cluster B to set up ETL, and Cluster C for any issues and delays, which could then be moved to a new cluster to enable parallel processing. One of the biggest benefits on offer with data bricks is the ability to create as many clusters as required to process data in parallel. This enabled much more efficient and quick processing of data improving data reporting speed by 80%.

Find out more about Databricks partnership.

215 million rows of data processed hourly

Cluster B where the ETL process is running had 16 nodes and a huge amount of data is being processed. Approximately 9 million rows of viewing data and 215 million rows of commercial data are processed on an hourly basis and around the clock, every day.

Errors and optimisation

As the streams of data are constantly flowing, our engineers provide operational monitoring and support to spot errors, resolve issues and continuously make recommendations to improve and optimise data performance. PagerDuty is employed for error alerts, which are then resolved by the Ardent data engineers.

Improving data turnaround by 80% with Databricks

Ardent ongoing monitoring and support

Overall, our clients can significantly reduce the data processing and reporting time with the adoption of Databricks. This offers them many benefits from improving productivity to a better data turnaround time for end clients. They have peace of mind with the operational and monitoring support as any errors and issues that may arise will be resolved quickly and efficiently. Additionally, both the Ardent team for this project and the client's data science team have regular meetings to discuss progress and optimisation suggestions.

Explore Ardent data engineering services.


More Success Stories

Ensuring timely data availability for real time mission critical data

Success Story

Monetizing broadcasting data

Media | Media

A market leader, internationally renowned media and broadcasting company Founded in 2002, our client has been around for over two decades and is an internationally known company dealing with broadcasting data for commercial use. With a mission of making high-quality technology and content affordable for everyone, they have established themselves as a market leader. [...]

Read More... from Fortune 500 company entertaining audiences for over two decades

Fine art storage & preservation software

Success Story

Making logistics simple

Logistics | Logistics, Software

Leader logistics software provider Our client is a leading logistics software provider in the UK. With over 3 decades of experience in the industry, they continuously look to innovate with technology. Their range of software products includes a warehouse management system and removal management software. They aim to remove the complexity of software and bring [...]

Read More... from Fortune 500 company entertaining audiences for over two decades

warehouse management automation user-friendly app

Success Story

Three decades of experience in delivering software excellence

Technology | Logistics, Software

Well-established logistics software provider Our client is a software products company providing software to the logistics industry and their main product was administration solution software for removal companies. With almost three decades of experience, our clients are leaders in the removals sector. Since the start, they have gone from strength to strength in becoming a [...]

Read More... from Fortune 500 company entertaining audiences for over two decades

Ardent Insights

Are you ready to take the lead in driving digital transformation?

Are you ready to take the lead in driving digital transformation?

Digital transformation is the process of modernizing and digitating business processes with technology that can offer a plethora of benefits including reducing long-term costs, improving productivity and streamlining processes. Despite the benefits, research by McKinsey & Company has found that around 70% of digital transformation projects fail, largely down to employee resistance. If you are [...]

Read More... from Fortune 500 company entertaining audiences for over two decades

Stateful vs Stateless

Stateful VS Stateless – What’s right for your application?

Protocols and guidelines are at the heart of data engineering and application development, and the data which is sent using network protocols is broadly divided into stateful vs stateless structures – these rules govern how the data has been formatted, how it sent, and how it is received by other devices (such as endpoints, routers, [...]

Read More... from Fortune 500 company entertaining audiences for over two decades

Getting data observability done right - Is Monte Carlo the tool for you (1)

Getting data observability done right – Is Monte Carlo the tool for you?

Data observability is all about the ability to understand, diagnose, and manage the health of your data across multiple tools and throughout the entire lifecycle of the data. Ensuring that you have the right operational monitoring and support to provide 24/7 peace of mind is critical to building and growing your company. [...]

Read More... from Fortune 500 company entertaining audiences for over two decades