Chip has worked as a researcher at Netflix, was a core developer at NVIDIA (building NeMo, NVIDIA’s GenAI framework), and co-founded Claypot AI. She also taught Machine Learning at Stanford University.
Q: How would you define AI Engineer or AI Engineering?
A: It's a shift from more machine learning to more engineering and more product. Previously, building machine learning applications required building your own models with your own data and expertise. Now, you can use direct API calls.
Q: What are typical steps to build an AI application, from choosing a model through using RAG all the way to fine-tuning?
A: Start by understanding what makes a 'good' vs. 'bad' response. Build guidelines for the model, add examples, and evaluate using both automated metrics and human evaluations. Augment the context with documents (RAG pattern). Consider starting with keyword retrieval before jumping into embedding-based retrieval. When fine-tuning, be aware of the new problems that you need to deal with.
Q: What are practical ways for software Engineers to get started building AI applications?
A: Start with prompting and simple solutions before moving to advanced techniques. Data preparation can give a huge performance boost. If you have tried a lot of solutions already, and now you try Gen AI and you think this Gen AI is a solution for you, understand what makes a good response, then add examples.
Q: How did you write this book, how were you able to write a book about such a fast moving industry?
A: When you write a topic you're trying to bet on whether it's going to stay relevant in the future.
Focused on fundamentals that have been around for a while.
Determined if a problem is due to fundamental limitations of AI or current capabilities.
Q: Given that AI engineering moves so quickly, how did you ensure your book stays current?
A: Prioritised progress and future relevance over current trends. Focused on the underlying fundamentals rather than fleeting capabilities. Paid attention to whether advancements are due to fundamental limitations or temporary capabilities of AI, and how quickly capabilities are changing.
Q: What are the most common techniques used when building AI applications, things that a software engineer who's going into building AI applications should know about?
A: Techniques like RAG (Retrieval-Augmented Generation) and fine-tuning are useful. Focus on understanding what constitutes a good response. RAG is a powerful pattern, but don't jump straight to vector search; start with keyword retrieval.
Q: Given this rush to use Gen AI, what are your thoughts on staying focused on the problem?
A: The more we want to not miss out on things, the more things we will miss. Pick a problem that you care about and only focus on things that help you solve that problem. If some news comes out, ask if it helps solve the problem; if not, wait.
Q: When building an AI system, you need to evaluate the output. Why is it difficult to evaluate AI systems, and what are common ways to do that?
A: It's challenging because the smarter AI becomes, the harder it is for humans to evaluate. Use functional correctness to evaluate how well it performs a task. Use AI to evaluate other AI. Comparative evaluations can also be helpful. The quality of the AI judge depends on the underlying model and the prompt, and it is so non-deterministic.
Q: In evaluating AI, how do you prioritise users’ needs and experience?
A: Evaluate an application based on what users want. Go talk to users and look at their interactions. Manual data inspection has a high value-to-effort ratio. Measure what matters. Don't forget human evaluations.
Q: What are some common mistakes you've seen when teams are building AI applications?
A: Using Gen AI when you don't need it. Jumping straight to complex solutions or fancy agent frameworks.
Systematic thinking and problem-solving approaches usually don't change, even when technologies do. Don't use GenAI when you don't need GenAI
Q: Speaking of picking up new technology, as someone learning GenAI, a software engineer who wants to get into AI engineering, what would your recommendation be to learn?
A: Balance project-based learning with structured learning. Be able to stop and ask questions, and sometimes structured learning can help you ask the right questions. Pick a project, complement it with structured learning, and read papers. Tutorial based learning is great, but it is very important to be able to stop and ask questions.
Q: Will AI mean the end of software engineering?
A: AI can automate coding, but it won't fully automate problem-solving. Software engineering is about solving problems, not just the physical act of coding.
Q: What other use cases are you excited about for that AI could bring outside of just coding?
A: Education. Entertainment.
Q: What programming language did you use most when you built AI applications or did ML engineering?
A: Python and JavaScript.
Q: Which one is your favorite LLM model right now and why?
A: I don't really have a favorite.
Q: What are one or two books that you've read and would recommend?
This Week I Learned - * navigator.sendBeacon() method is used to quietly send analytics or telemetry data, often when a page is closing. * P4RS3LT0NGV3 is a Universal Text Translator. Think of it as a universal translator for ALL alphabets and writing systems! It is a powerful web-based text transformation and steganography tool with 159 built-in text transforms spanning encodings, classical and modern ciphers, Unicode styles, formatting, and niche alphabets. * When you see that Claude is "organizing its thoughts" during long conversations—it indicates automatic context management is working. * Fable 5 is now the most capable model of Claude and takes 2X the usage of Opus. Fable 5 is the same base model as Mythos but with cybersecurity guardrails. In the same week of its release, Anthropic has disabled Fable 5 and Mythos 5 for all users after a US directive restricting foreign national access, citing national security concerns. * Meta chief product officer Chris Cox o...
Some of the most beloved tracks in pop history are built on lyrics that, on paper, should be complete gibberish. Yet they somehow feel profound, catchy, and emotionally true. It’s remarkably similar to how modern AI strings together convincing words — fluent, rhythmic, and evocative, even when there’s no deeper literal meaning behind them. Two perfect examples are Nik Kershaw’s 1984 synth-pop earworm “The Riddle” and A.R. Rahman’s infectious “Mental Madhilo” (Telugu) / “Mental Manadhil” (Tamil) from the 2015 film OK Bangaram / O Kadhal Kanmani . Both prove that sometimes the less the words mean, the more they resonate. Nik Kershaw has been refreshingly honest for decades: “The Riddle” is deliberate nonsense. He scribbled the cryptic, vivid lines (“Near a tree by a river, there’s a hole in the ground / Where an old man of Aran goes around and around”) as a temporary guide vocal and never replaced them. The band loved the flow, the public turned it into a massive hit, and generations ha...
Comments
Post a Comment