7 November 2022 | Noor Khan
Headquartered in California, our client are a well-established Fortune 500 company worth over a few billion as of October 2022. They deal with a large scale of various broadcasting data including audience and commercial data. We have worked with them on a number of projects to help unlock the potential of data by continuously improving and optimising data performance.
Our client deal with huge volumes of data and were having delays in their reporting. They were running around 80 reports and each report took around 4 to 5 minutes to be produced. The data reports delay of each report adds a considerable amount of time to the reporting time of the full 80 reports. Therefore, our client were looking for an alternative solution to significantly improve their data reporting turnaround.
Our clients existing data structure was built on Amazon Redshift, which is a powerful technology, however, it presented data delay challenges for our clients' data. Therefore, we recommend Databricks as the alternative to processing data quickly and efficiently. Databricks offered scalable, efficient and quicker processing of data with the use of independent clusters which can run parallel.
Databricks clusters
Our highly experienced data engineers created three clusters on Databricks including Cluster A for storing all data, Cluster B to set up ETL, and Cluster C for any issues and delays, which could then be moved to a new cluster to enable parallel processing. One of the biggest benefits on offer with data bricks is the ability to create as many clusters as required to process data in parallel. This enabled much more efficient and quick processing of data improving data reporting speed by 80%.
Find out more about Databricks partnership.
215 million rows of data processed hourly
Cluster B where the ETL process is running had 16 nodes and a huge amount of data is being processed. Approximately 9 million rows of viewing data and 215 million rows of commercial data are processed on an hourly basis and around the clock, every day.
Errors and optimisation
As the streams of data are constantly flowing, our engineers provide operational monitoring and support to spot errors, resolve issues and continuously make recommendations to improve and optimise data performance. PagerDuty is employed for error alerts, which are then resolved by the Ardent data engineers.
Overall, our clients can significantly reduce the data processing and reporting time with the adoption of Databricks. This offers them many benefits from improving productivity to a better data turnaround time for end clients. They have peace of mind with the operational and monitoring support as any errors and issues that may arise will be resolved quickly and efficiently. Additionally, both the Ardent team for this project and the client's data science team have regular meetings to discuss progress and optimisation suggestions.
Explore Ardent data engineering services.
A market leader, internationally renowned media and broadcasting company Founded in 2002, our client has been around for over two decades and is an internationally known company dealing with broadcasting data for commercial use. With a mission of making high-quality technology and content affordable for everyone, they have established themselves as a market leader. [...]
Read More... from Fortune 500 company entertaining audiences for over two decades
Leader logistics software provider Our client is a leading logistics software provider in the UK. With over 3 decades of experience in the industry, they continuously look to innovate with technology. Their range of software products includes a warehouse management system and removal management software. They aim to remove the complexity of software and bring [...]
Read More... from Fortune 500 company entertaining audiences for over two decades
Well-established logistics software provider Our client is a software products company providing software to the logistics industry and their main product was administration solution software for removal companies. With almost three decades of experience, our clients are leaders in the removals sector. Since the start, they have gone from strength to strength in becoming a [...]
Read More... from Fortune 500 company entertaining audiences for over two decades
Digital transformation is the process of modernizing and digitating business processes with technology that can offer a plethora of benefits including reducing long-term costs, improving productivity and streamlining processes. Despite the benefits, research by McKinsey & Company has found that around 70% of digital transformation projects fail, largely down to employee resistance. If you are [...]
Read More... from Fortune 500 company entertaining audiences for over two decades
Protocols and guidelines are at the heart of data engineering and application development, and the data which is sent using network protocols is broadly divided into stateful vs stateless structures – these rules govern how the data has been formatted, how it sent, and how it is received by other devices (such as endpoints, routers, [...]
Read More... from Fortune 500 company entertaining audiences for over two decades
Data observability is all about the ability to understand, diagnose, and manage the health of your data across multiple tools and throughout the entire lifecycle of the data. Ensuring that you have the right operational monitoring and support to provide 24/7 peace of mind is critical to building and growing your company. [...]
Read More... from Fortune 500 company entertaining audiences for over two decades