At CodeValue, we specialize in developing innovative software solutions that address our clients' unique challenges. Recently, we had the opportunity to partner with SeeMore, an end-to-end data product optimization platform that focuses on maximizing data ROI, attributing data spend, and eliminating cost spikes across various pipelines and stacks. Our task was to enhance SeeMore's ability to understand and optimize their complex data processes by generating automated descriptions, ultimately supporting their mission to drive growth through data-driven insights.
SeeMore required a solution to automatically generate detailed descriptions for large SQL tables, SQL jobs, and data flow lineage from their databases. Their key goals included improving data transparency, making it easier to understand complex data transformations, and optimizing their data workflows.
To address SeeMore's needs, we developed three key capabilities:
To make these capabilities feasible within the constraints of LLMs, we implemented a splitting algorithm that intelligently breaks down large datasets and descriptions to fit within the LLM context window. This ensures that even the most extensive data can be processed effectively.
A critical aspect of our solution was the introduction of a rate limiter to manage the usage of LLMs. This mechanism not only helps SeeMore control operational costs but also ensures that their system remains efficient and responsive under varying loads.
Furthermore, we developed the capability to utilize multiple model providers, This flexibility allows SeeMore to select the most suitable model for their specific needs, ensuring optimal performance and cost-effectiveness.
Additionally, we created evaluation tests for the models to continuously assess their performance and make necessary adjustments to the code and prompts. This iterative approach ensures that the models remain accurate and aligned with SeeMore's evolving requirements.
Our work with SeeMore showcases how advanced Gen AI integration can transform complex data environments. By enabling automated descriptions of SQL tables, jobs, and data flow lineage, we’ve helped SeeMore achieve greater transparency and efficiency in their operations. The introduction of a splitting algorithm, rate limiter, multi-model support, and continuous model evaluation further underscores our commitment to delivering tailored, cost-effective solutions that meet our clients' evolving needs.
"Working with Code value we found a professional partner who quickly understand our needs and is able to deliver the capabilities we need."
Ariel Utnik, CEO & Co-Founder, Seemore Data