PDF fund data extraction, validation and conversion to Parquet file support for leading Fund of Funds

  • 35%

    Faster turnaround time with increased accuracy

  • ~60%

    Enhanced bandwidth for onshore teams

  • ~$3M

    Annualized cost savings


CLIENT CHALLENGES

  • Inefficient portfolio data management and analysis
  • Cost-heavy data operations and manual workflows
  • Manual data extraction leading to inconsistencies and scaling challenges

OUR APPROACH

  • Developed fund data extraction framework for efficiency gains
  • Automated fund data extraction from PDF documents into predefined excel templates, based on data mapping provided by Acuity Data Ops team for new and existing funds
  • Automated data validation for missing data, data type discrepancies and highlight cells without data in the template to inform the Acuity Data Ops team
  • Implemented 3-Tier quality checks during the entire data journey
  • Developed, maintained and enhanced framework to convert excel data template files into Parquet files and upload into AWS S3 buckets post validation

IMPACT DELIVERED

  • 5,300+ funds coverage, 30K+ underlying portfolio companies, 5M+ KPIs updated annually, 50K+ fund documents managed
  • Implemented a scalable solution taking into account the existing infrastructure and models.
  • Robust solution with high quality output, driven by subject matter experts and rigorous quality check process.
Thank you for sharing your details

Your file will start downloading automatically

If it does not download within 1 minute,

Share this on