Summary<br />In this episode of the AI Engineering Podcast Jim Olsen, CTO of ModelOp, talks about the governance of generative AI models and applications. Jim shares his extensive experience in software engineering and machine learning, highlighting the importance of governance in high-risk applications like healthcare. He explains that governance is more about the use cases of AI models rather than the models themselves, emphasizing the need for proper inventory and monitoring to ensure compliance and mitigate risks. The conversation covers challenges organizations face in implementing AI governance policies, the importance of technical controls for data governance, and the need for ongoing monitoring and baselines to detect issues like PII disclosure and model drift. Jim also discusses the balance between innovation and regulation, particularly with evolving regulations like those in the EU, and provides valuable perspectives on the current state of AI governance and the need for robust model lifecycle management.<br /><br /><br />Announcements<br /><ul><li>Hello and welcome to the AI Engineering Podcast, your guide to the fast-moving world of building scalable and maintainable AI systems</li><li>Your host is Tobias Macey and today I'm interviewing Jim Olsen about governance of your generative AI models and applications</li></ul>Interview<br /><ul><li>Introduction</li><li>How did you get involved in machine learning?</li><li>Can you describe what governance means in the context of generative AI models? (e.g. governing the models, their applications, their outputs, etc.)</li><li>Governance is typically a hybrid endeavor of technical and organizational policy creation and enforcement. From the organizational perspective, what are some of the difficulties that teams are facing in understanding what those policies need to encompass?<ul><li>How much familiarity with the capabilities and limitations of the models is necessary to engage productively with policy debates?</li></ul></li><li>The regulatory landscape around AI is still very nascent. Can you give an overview of the current state of legal burden related to AI?<ul><li>What are some of the regulations that you consider necessary but as-of-yet absent?</li></ul></li><li>Data governance as a practice typically relates to controls over who can access what information and how it can be used. The controls for those policies are generally available in the data warehouse, business intelligence, etc. What are the different dimensions of technical controls that are needed in the application of generative AI systems?<ul><li>How much of the controls that are present for governance of analytical systems are applicable to the generative AI arena?</li></ul></li><li>What are the elements of risk that change when considering internal vs. consumer facing applications of generative AI?<ul><li>How do the modalities of the AI models impact the types of risk that are involved? (e.g. language vs. vision vs. audio)</li></ul></li><li>What are some of the technical aspects of the AI tools ecosystem that are in greatest need of investment to ease the burden of risk and validation of model use?</li><li>What are the most interesting, innovative, or unexpected ways that you have seen AI governance implemented?</li><li>What are the most interesting, unexpected, or challenging lessons that you have learned while working on AI governance?</li><li>What are the technical, social, and organizational trends of AI risk and governance that you are monitoring?</li></ul>Contact Info<br /><ul><li><a href="https://www.linkedin.com/in/jimolsen/" target="_blank">LinkedIn</a></li></ul>Parting Question<br /><ul><li>From your perspective, what are the biggest gaps in tooling, technology, or training for AI systems today?</li></ul>Closing Announcements<br /><ul><li>Thank you for listening! Don't forget to check out our other shows. The <a href="https://www.dataengineeringpodcast.com" target="_blank">Data Engineering Podcast</a> covers the latest on modern data management. <a href="https://www.pythonpodcast.com" target="_blank">Podcast.__init__</a> covers the Python language, its community, and the innovative ways it is being used.</li><li>Visit the <a href="https://www.aiengineeringpodcast.com" target="_blank">site</a> to subscribe to the show, sign up for the mailing list, and read the show notes.</li><li>If you've learned something or tried out a project from the show then tell us about it! Email [email protected] with your story.</li><li>To help other people find the show please leave a review on <a href="https://podcasts.apple.com/us/podcast/the-machine-learning-podcast/id1626358243" target="_blank">iTunes</a> and tell your friends and co-workers.</li></ul>Links<br /><ul><li><a href="https://www.modelop.com/" target="_blank">ModelOp</a></li><li><a href="https://en.wikipedia.org/wiki/Foundation_model" target="_blank">Foundation Models</a></li><li><a href="https://en.wikipedia.org/wiki/General_Data_Protection_Regulation" target="_blank">GDPR</a></li><li><a href="https://www.europarl.europa.eu/topics/en/article/20230601STO93804/eu-ai-act-first-regulation-on-artificial-intelligence" target="_blank">EU AI Regulation</a></li><li><a href="https://www.llama.com/llama2/" target="_blank">Llama 2</a></li><li><a href="https://aws.amazon.com/bedrock/" target="_blank">AWS Bedrock</a></li><li><a href="https://en.wikipedia.org/wiki/Shadow_IT" target="_blank">Shadow IT</a></li><li><a href="https://en.wikipedia.org/wiki/Retrieval-augmented_generation" target="_blank">RAG == Retrieval Augmented Generation</a><ul><li><a href="https://www.aiengineeringpodcast.com/retrieval-augmented-generation-implementation-episode-34" target="_blank">Podcast Episode</a></li></ul></li><li><a href="https://github.com/NVIDIA/NeMo" target="_blank">Nvidia NEMO</a></li><li><a href="https://www.langchain.com/" target="_blank">LangChain</a></li><li><a href="https://shap.readthedocs.io/en/latest/example_notebooks/overviews/An%20introduction%20to%20explainable%20AI%20with%20Shapley%20values.html" target="_blank">Shapley Values</a></li><li><a href="https://llm-guard.com/output_scanners/gibberish/" target="_blank">Gibberish Detection</a></li></ul>The intro and outro music is from <a href="https://freemusicarchive.org/music/The_Freak_Fandango_Orchestra/Tales_Of_A_Dead_Fish/Hitmans_Lovesong/" target="_blank">Hitman's Lovesong feat. Paola Graziano</a> by <a href="http://freemusicarchive.org/music/The_Freak_Fandango_Orchestra/" target="_blank">The Freak Fandango Orchestra</a>/<a href="https://creativecommons.org/licenses/by-sa/3.0/" target="_blank">CC BY-SA 3.0</a>