Wednesday, June 5, 2024
11:45 AM - 12:30 PM
Intermediate
In the era of growing unstructured data and advanced Large Language Models (LLMs), the risk of sensitive data exposure has become a paramount concern for enterprises. As reported in a study by Veritas Technologies, with an average of 17.5 petabytes of unstructured data, of which approximately 55% is classified as dark data, organizations are increasingly challenged with safeguarding data in AI interactions. In this talk, we will present a solution architecture that we have implemented that integrates AI-driven data classification, robust access controls, and compliance mechanisms. We will describe how this approach enhances data security, ensures AI compliance, and streamlines sensitive data management while boosting operational efficiency and risk mitigation. We will also discuss why it is pivotal for organizations seeking to leverage AI capabilities responsibly in the LLM era to adopt this framework.
Implementing a comprehensive data security solution is essential for modern enterprises coping with vast amounts of sensitive information. The success of our solution demonstrates that a hybrid approach combining deterministic and probabilistic methods, such as pattern matching and advanced AI/ML models, is effective in identifying and flagging sensitive data with overly permissive access controls. Our approach underlines the importance of a sensitivity rules database, categorized into Core, Common, and Unique Rules, to tailor data protection strategies to the specific needs of an organization. Join us in this session to learn more about this extensible framework that organizations can implement to secure their sensitive data in an increasingly complex digital environment.
Urmi Majumder is a Principal Consultant in the Advanced Data and Enterprise AI practice at Enterprise Knowledge where she leads system architecture, design, and implementation of a broad range of enterprise solutions. She has 15 years of experience leading the development of technical solutions in support of a wide variety of federal and commercial clients by integrating open-source, SaaS, and COTS tools, as well as establishing the connection between these tools and their business users. Her diverse portfolio includes the design and development of data-centric solutions including content management systems, record management systems, knowledge portals, search applications, semantic applications, data catalogs, and AI/ML applications, both in the context of new system development and data modernization efforts.