Classify It: Developing a Classification Engine for Data Loss Prevention
Robin Franklin Guha ● April 29, 2025
This session covers Meta’s development of a multi-stage classification engine for data loss prevention, utilizing traditional machine learning and Llama to classify unstructured text. With over 100 million files classified in internal document sharing platforms and blob storage, experts will discuss the problem, from labeled data collection to model development, testing, and deployment.