Talking to Notebooks with Jupyter AI

Episode #440, published Thu, Nov 30, 2023, recorded Mon, Oct 30, 2023

Episode Deep Dive Links Transcript

We all know that LLMs and generative AI has been working its way into many products. It's Jupyter's turn to get a really awesome integration. We have David Qiu here to tell us about Jupyter AI. Jupyter AI provides a user-friendly and powerful way to apply generative AI to your notebooks. It lets you choose from many different LLM providers and models to get just the help you're looking for. And it does way more than just a chat pane in the UI. Listen to find out.

Play on YouTube

Watch the live stream version

Episode Deep Dive

Guests

David Qiu is a seasoned software engineer at AWS, specializing in the AI and Machine Learning organization. With nearly two years at AWS, David contributes significantly to Project Jupyter under the guidance of Brian Granger, the co-founder of Project Jupyter. David is the creator of Jupyter AI, an innovative extension that integrates large language models (LLMs) into Jupyter notebooks, enhancing the functionality and user experience for developers and data scientists alike. He has showcased Jupyter AI at notable conferences, including PyData Seattle 2023 and JupyterCon in Paris 2023, demonstrating its capabilities to a wide audience.

What to Know If You're New to Python

To get the most out of the "Talking to Notebooks with Jupyter AI" episode, it's beneficial to have a basic understanding of Jupyter notebooks and familiarity with Python's ecosystem. Familiarize yourself with how Jupyter extensions work and the fundamentals of large language models (LLMs). This knowledge will help you appreciate the integration and functionalities discussed in the episode.

Key Points and Takeaways

Introduction to Jupyter AI Jupyter AI is a powerful extension for Project Jupyter that seamlessly integrates generative AI into your notebooks. It supports multiple LLM providers and models, allowing users to select the best fit for their specific needs. Unlike traditional chat panes, Jupyter AI offers a comprehensive set of tools to enhance productivity and streamline workflows.
- Learn More: Jupyter AI Documentation
Multi-LLM Support and Model Agnosticism One of the standout features of Jupyter AI is its ability to interface with various LLM providers such as OpenAI, Anthropic, and AI21. This model-agnostic approach ensures flexibility and prevents vendor lock-in, enabling users to experiment with different models to find the most suitable one for their tasks.
- Relevant Links: Jupyter AI Documentation
Magic Commands in Jupyter AI Magics Jupyter AI introduces magic commands through its Jupyter AI Magics package, which allows users to interact with AI models directly within the IPython shell. Commands like %AI enable functionalities such as code explanation, refactoring, and more, enhancing the interactivity and intelligence of your notebooks.
- GitHub Repository: Jupyter AI on GitHub
Slash Commands: Learn and Ask The /learn and /ask slash commands empower users to teach Jupyter AI about specific files and query information contextually. By using /learn [file_path], users can instruct Jupyter AI to understand and retain information from particular documents, enabling more informed and relevant responses when using /ask [question].
- Example Usage: /learn documentation.md followed by /ask Explain the function X in the documentation.
Privacy and Security Considerations Jupyter AI prioritizes user privacy and security by ensuring that all interactions with third-party language models are transparent and traceable. Prompts sent to AI models are logged, and data is only transmitted upon explicit user actions. This approach provides users with control over their data and maintains compliance with privacy policies.
- Learn More: Jupyter AI Documentation
Generating New Notebooks Jupyter AI can generate entire notebooks based on natural language prompts. By breaking down the generation process into smaller, manageable tasks, Jupyter AI ensures focused and accurate content creation. This feature is particularly useful for creating tutorial-style notebooks that guide users through specific topics or projects.
- Relevant Links: Generating a New Notebook
Embedding Models and Semantic Search To enhance context understanding, Jupyter AI utilizes embedding models that map syntax to high-dimensional semantic spaces. This allows for semantic search capabilities, enabling the AI to retrieve and reference information based on meaning rather than mere keyword matching. This feature significantly improves the relevance and accuracy of AI-generated responses.
- Tools Mentioned: Langchain, Dask
Interpolating Prompts with Variables Jupyter AI supports prompt interpolation, allowing users to incorporate variables from their notebooks into AI prompts. This feature enables dynamic and context-aware interactions, making it easier to reference and manipulate data within your AI-driven workflows.
- Example Usage: Defining a variable poet = "Walt Whitman" and using it in a prompt like Write a poem in the style of {poet}.
Future Developments and Agent Integration David Qiu discussed the potential integration of agents to further streamline interactions, eliminating the need for slash commands. Although still in the experimental phase, this development aims to make Jupyter AI even more intuitive and user-friendly by allowing natural language prompts without predefined command structures.
Integration with Other Tools: Langchain and Dask Jupyter AI leverages powerful Python libraries like Langchain for building language model applications and Dask for parallel and distributed computing. These integrations enhance Jupyter AI's capabilities, providing robust solutions for complex data science and software development tasks.
- Langchain: GitHub Repository
- Dask: Dask Documentation

Quotes and Stories

David Qiu on Conference Experience: "I had the documentation already available in my home directory... I just had that laptop on the side and told people, like, if you have any questions, try answering that."
On the Power of Slash Commands: "With the fluid back and forth, it's really nice. It makes AI more human, not for any specific persona, just for humans in general."
On Privacy and Traceability: "Jupyter AI is both transparent and traceable. Whenever you use a language model that's hosted by a third party, that's always captured in the server logs by default."

Overall Takeaway

Jupyter AI represents a significant advancement in integrating generative AI within the Jupyter ecosystem, offering versatile tools that cater to both software developers and data scientists. By supporting multiple language models, introducing intuitive magic and slash commands, and prioritizing privacy and security, Jupyter AI enhances the functionality and user experience of Jupyter notebooks. Its seamless integration with powerful libraries like Langchain and Dask further solidifies its position as an indispensable tool for modern Python workflows. Whether you're looking to generate comprehensive tutorials, refactor code effortlessly, or harness the full potential of AI-driven insights, Jupyter AI provides the flexibility and intelligence needed to elevate your projects to new heights.

Links from the show

David Qiu: linkedin.com

Jupyter AI: jupyter-ai.readthedocs.io

Asking about something in your notebook: jupyter-ai.readthedocs.io
Generating a new notebook: jupyter-ai.readthedocs.io
Learning about local data: jupyter-ai.readthedocs.io
Formatting the output: jupyter-ai.readthedocs.io
Interpolating in prompts: jupyter-ai.readthedocs.io
JupyterCon 2023 Talk: youtube.com
PyData Seattle 2023 Talk: youtube.com
Watch this episode on YouTube: youtube.com
Episode #440 deep-dive: talkpython.fm/440
Episode transcripts: talkpython.fm

--- Stay in touch with us ---
Subscribe to Talk Python on YouTube: youtube.com
Talk Python on Bluesky: @talkpython.fm at bsky.app
Talk Python on Mastodon: talkpython
Michael on Bluesky: @mkennedy.codes at bsky.app
Michael on Mastodon: mkennedy
Episode #440 deep-dive: talkpython.fm/440

Episode Transcript

Collapse transcript

WebVTT format On GitHub

00:00 We all know that LLMs and generative AI have been working their way into many products.

00:05 Well, it's Jupyter's turn to get a really awesome integration.

00:08 We have David Kui here to tell us about Jupyter AI.

00:11 Jupyter AI provides a user-friendly and powerful way to apply generative AI to your notebooks.

00:17 It lets you choose from many different LLM providers and models to get just the help that you're looking for.

00:23 And it does way more than just add a chat pane in the UI.

00:27 Listen in to find out.

00:29 This is Talk Python To Me, episode 440, recorded October 30th, 2023.

00:34 Welcome to Talk Python To Me, a weekly podcast on Python.

00:52 This is your host, Michael Kennedy.

00:54 Follow me on Mastodon, where I'm @mkennedy and follow the podcast using @talkpython.

00:59 Both on Bostodon.org.

01:01 Keep up with the show and listen to over seven years of past episodes at talkpython.fm.

01:06 We've started streaming most of our episodes live on YouTube.

01:10 Subscribe to our YouTube channel over at talkpython.fm/youtube to get notified about upcoming shows and be part of that episode.

01:19 This episode is sponsored by Posit Connect from the makers of Shiny.

01:23 Publish, share, and deploy all of your data projects that you're creating using Python.

01:27 Streamlit, Dash, Shiny, Bokeh, FastAPI, Flask, Quattro, Reports, Dashboards, and APIs.

01:34 Posit Connect supports all of them.

01:36 Try Posit Connect for free by going to talkpython.fm/posit.

01:41 P-O-S-I-T.

01:42 And it's also brought to you by us over at Talk Python Training.

01:46 Did you know that we have over 250 hours of Python courses?

01:51 Yeah, that's right.

01:52 Check them out at talkpython.fm/courses.

01:57 David, welcome to Talk Python To Me.

01:59 Awesome to have you here.

02:00 Yeah, thank you, Michael.

02:01 I'm really excited to see what the AIs have to say today.

02:04 The AIs?

02:05 Yeah, language models, sure.

02:06 Yes, exactly, exactly.

02:08 Now, you've built a really cool extension for Jupyter that plugs in large language models for people.

02:15 And it's looking super interesting.

02:17 So I'm excited to talk to you about it.

02:19 Yeah, I'm excited to talk about Jupyter AI, too.

02:21 I've actually presented this twice at one set.

02:24 Actually, three times.

02:26 I did a short demo at this tech meetup thing in Seattle.

02:30 That was actually the first time Jupyter AI was shown to the public.

02:33 And then I presented at PyData Seattle at Microsoft's Redmond campus.

02:38 And then I got to present again at JupyterCon in Paris this May.

02:43 It was a really wonderful experience.

02:45 But yeah.

02:46 Wow.

02:46 Yeah, you're making the rounds.

02:47 Yeah.

02:48 I love to talk about Jupyter AI.

02:50 It happens to get me some plane tickets.

02:52 Just joking.

02:55 Honestly, that's like half the bonus of conferences is like awesome places you get to go.

02:59 The other half probably is the people you meet.

03:02 You know, it's really cool.

03:02 Oh, for me, it's like almost like all the people.

03:05 Like the people are just so great, especially JupyterCon.

03:08 Yeah.

03:08 And the JupyterCon videos are now out for JupyterCon 2023.

03:12 And there's a ton of good looking talks there.

03:14 Yeah.

03:15 Lots of really smart people.

03:16 I mean, like I was chatting to a few folks there.

03:20 And that's like the only place where you're going to find like people who work at these hedge funds and trading firms just lounging so idly and casually.

03:29 Right.

03:29 Like.

03:31 Yeah.

03:32 The market's opening and they're chilling.

03:33 It's all fine.

03:34 Yeah.

03:34 It's a, there's a lot of smart people there.

03:36 Yeah.

03:37 Jupyter more than a lot of programming technologies bring people from all sorts of different places together, different backgrounds.

03:44 There's like a huge, yeah, there's like a lot of reasons behind that.

03:48 But long story short, Jupyter's pretty awesome.

03:51 And that's kind of why I work to contribute to it.

03:53 Awesome.

03:53 Well, let's start this whole conversation with a bit of background about yourself.

03:57 And for people who didn't see your talk and don't know you yet, tell them a bit about you.

04:02 I didn't really give much of an intro there either, but sure.

04:05 Yeah.

04:06 So I've worked for AWS as a software engineer.

04:09 And I've been with AWS, specifically the AI ML organization at AWS.

04:15 I've been with them for almost two years now.

04:17 Right now, I, my manager is actually Brian Granger, who's the co-founder of Project Jupyter.

04:23 He also works for AWS.

04:25 Yeah.

04:25 So he's been offering some technical and product guidance for the things that we're building.

04:31 And he's a fantastic gentleman to work with.

04:35 Cool.

04:35 Yeah.

04:35 It's a really neat, neat to have him available as a resource, you know, as a colleague.

04:40 Yeah.

04:40 You know, it's funny.

04:41 Like I actually, yeah, I met him internally.

04:44 So when I first joined, I wasn't working for him, but at tech companies, you can do this internal

04:49 transfer thing.

04:50 And basically my old team was kind of the team I just joined right after I joined, it sort

04:56 of started to dissolve a little because they just launched a product at reInvent, which happens

05:01 in like December.

05:03 And then, so I joined in December and it's like, oh, hi.

05:07 So then I, yeah.

05:09 And then I joined, I messaged, I saw just, saw Brian Granger's name somehow and I messaged

05:14 him and I didn't even know that he was the co-founder of Project Jupyter.

05:18 I just wanted to work for him because I used it before.

05:21 And yeah, it's a pretty funny story.

05:23 Indeed.

05:24 I imagine that project is, is really, you know, this Jupyter AI is a great example, but

05:29 I just thinking of being say a founder of Jupyter or, or something like that.

05:34 And these things take on a life of their own.

05:36 And he's probably in awe of all the stuff happening and all the things going on.

05:41 And there's probably a lot of stuff in Jupyter.

05:43 He doesn't even know about, right?

05:44 It's like, that's happening.

05:45 Yeah, it's huge.

05:46 And yeah.

05:47 And like the leadership structure has changed to accommodate that.

05:50 So Brian is no longer the benevolent dictator for life.

05:54 Jupyter Project Jupyter is now governed by a committee decentralized and democratized just

05:59 to allow it to scale.

06:00 Yeah, of course.

06:01 Let's start by talking about a bit of the role of AI and data science.

06:07 I don't know how you feel about it.

06:09 Obviously you must be somewhat of an advocate putting this much time and energy into bringing

06:14 it to Jupyter.

06:14 Wow.

06:15 However, personally, when I want to know something, I don't think there's a great specific

06:22 search result for it straight to ChatGPT or friends.

06:27 I think there's, there's such a, such a wealth of information there from, I need to take

06:31 this paragraph and clean it up and make it sound better to, I have this program.

06:35 I want to convert to another language, or I have this data in this website.

06:41 How do I get it?

06:42 You know, like just, you can ask so many open-ended questions and really get great answers.

06:47 So it seems to me, especially coming from, like I mentioned before, those diverse backgrounds,

06:51 people not necessarily being like super deep in programming, maybe they're deep in finance,

06:56 but they do programming that having this AI capability to ask like, Hey, I know I

07:01 can, but how, what do you think for data science in particular?

07:05 This is an interesting topic, right?

07:09 Because I think the whole power of language models stems from their ubiquity and versatility

07:13 and how they can sort of be very generally applicable.

07:16 So like the thing about language models is that they're basically statistical models that have

07:22 been trained on a very, very, very large corpus of data.

07:25 And that's really to the computer.

07:28 The computer doesn't really understand English.

07:31 It doesn't really understand natural language.

07:32 It has, it's basically making, it has like, when it's trained, it has like knowledge of the

07:40 distribution of information and how information sort of interacts with other information.

07:45 Because of that, it's a very, it has a very general applicability, right?

07:49 And I don't think that's utilities limited to data science.

07:53 Now, if we're talking about the field of data science specifically, I think language models

07:58 have extraordinary utility and explanatory natural language tests, which I think everybody is

08:05 aware of now, now that ChatGPT has been out for almost a year.

08:08 But I think in the field of data science and other like deep technical fields, they're especially

08:14 applicable because of how complicated some of the work is.

08:18 ChatAI can also help analyze and debug code, which JupyterLab also allows you to do.

08:24 I know it's statistics, but when you look at it, it seems like it understands, right?

08:28 It seems like it understands my question.

08:31 And I think one of the really interesting parts is the fact that these LLMs have a context.

08:37 I would like you to write a program in Python.

08:39 Okay, great.

08:40 Tell it to me.

08:41 I want a program that does X, Y, Z, and then it writes it in Python, which sounds so simple.

08:46 But up until then, things like Siri and all the other voice assistants, they seemed so

08:53 disjointed and so not understanding.

08:55 You're like, I just asked you about the weather.

08:57 And when I ask you how hot it is, you know, like, how do you not understand that that applies

09:01 to the weather, right?

09:02 There's just in the fact that you converse with them over a series of interactions is pretty

09:08 special.

09:08 The context.

09:09 Yeah.

09:10 It's basically implemented just by passing the history, the whole history appended to your

09:15 prompt.

09:16 So, yeah.

09:16 It's not like any super staple magic or whatever, but it's still very interesting, like, how

09:23 far you can take it.

09:24 And yeah, definitely, like, the context allows you to interact with the AI a lot more conversationally

09:32 and humanly.

09:33 Like, you don't have to pretend like you're talking to an AI.

09:36 You can actually just kind of treat it like a human and it still answers questions very well.

09:41 There was even at Google, there was that engineer who said they thought it had become sentient.

09:46 And there was that whole drama around that, right?

09:49 This is such a crazy coincidence.

09:50 But my roommate is actually friends with that gentleman.

09:54 Oh, really?

09:55 I know.

09:55 Wow.

09:56 Absolutely crazy coincidence.

09:58 Like, it's just fun.

09:59 I just thought it was really funny you bringing it up.

10:01 It was that, it was a Cajun gentleman, a senior engineer at Google, right?

10:05 Yeah.

10:06 Very funny, Mike.

10:07 It's been a little while.

10:08 I don't remember all the details, but yeah, I mean, it's pretty wild and pretty powerful.

10:12 I think I've recently read, I'm trying to quick look it up, but I didn't find it.

10:16 I think they just used LLMs to discover, like, a new protein folding.

10:21 And it's that kind of stuff that makes me think, like, okay, how interesting that it's, you know,

10:26 that knowledge wasn't out there in the world, necessarily.

10:29 I have a lot to say on that subject.

10:31 So personally, I don't believe that language models are actually intelligent.

10:35 I think that people are conflating, well, they're certainly not conscious, right?

10:41 Absolutely.

10:41 Yeah.

10:41 As to whether they're intelligent, I don't think they are.

10:44 I think that intelligence has a, like, intelligence as we know it, as humans know it, has some

10:50 different characteristics that language models don't really exhibit.

10:53 They're best thought of as, like, really, really, really good statistical models.

10:57 Like, are you familiar with the mirror test?

11:00 Maybe, but I don't think so.

11:01 Yeah.

11:01 So it's, like, this idea in animal psychology, but, like, if a cat sees a mirror, it thinks

11:07 it's another cat because it doesn't recognize its own inflection.

11:10 Right.

11:10 You see them all get, like, big and, like, trying to, like, act big and tough to chase it off,

11:15 and it's just them, yeah?

11:16 Language models are kind of like that, but for humans, right?

11:19 Like, if something mimics human-like qualia closely enough, very tempting to think of it

11:24 as human.

11:25 Yeah.

11:25 We see faces on Mars when it's really just erosion, stuff like that.

11:29 Exactly.

11:30 I could talk about this, like, for a full hour, but yeah, we should totally move on to

11:34 another topic before I go on a tirade.

11:36 Well, when you're saying it's not that intelligent, I'm wondering if maybe you've misnamed this project.

11:42 Maybe it should just be Jupyter A, like the I?

11:45 Do you got to drop the I?

11:46 I don't know.

11:47 We're just following convention, right?

11:48 So I still use the term AI.

11:50 Of course.

11:51 You got to talk to people.

11:52 Yeah.

11:52 Emphasize the artificial, huh?

11:54 Before we get to Jupyter AI, what came before?

11:57 How did people work with things like ChatGPT and other LLMs in Jupyter before stuff like Jupyter AI came along?

12:03 I think initially it was a combination.

12:06 So the initial motivation for this project came through a combination of sort of a demo put together by Fernando Perez, who is another, I believe.

12:16 I think he's another co-founder.

12:17 another co-founder of Project Jupyter, and he put together this demo called Jupyter, which is, it's like spelled like Jupyter, except the last letter is an E.

12:27 And it's a pun of like ChatGPT, right?

12:30 Jupyter.

12:31 There was a combination of that demo project set by Fernando and some motivation from my manager, Brian, who also was, you know, as a leader in the AWS AI organization.

12:46 You know, he's always trying to think of fancy, fancy new ideas, right?

12:51 And this is a pretty fun idea to work out.

12:54 So I put together, I think this was sometime in early January, I put together the first demo.

13:00 It was private.

13:01 And I showed it off to the team and they were like, wow, this has a lot of potential.

13:05 Let's see if we can grow it a bit more.

13:07 And then as we worked on it for the next few months, it became clear like, oh, wow, this is actually really significant.

13:15 Let's keep working on this.

13:16 So it's definitely been a collaborative effort to bring Jupyter AI to where it is today.

13:23 Sounds like it.

13:24 Definitely a lot of contributors over on the GitHub listing.

13:26 Let's get into it.

13:27 What is Jupyter AI?

13:28 People can guess, but it's also different in ways than maybe just plug it in a chat window.

13:34 Jupyter AI is actually, right now, it's two packages.

13:38 But it's best thought of as just a set of packages that bring generative AI to Project Jupyter as a whole.

13:44 So not just Jupyter Lab, but also Jupyter Notebook and IPython.

13:48 Even the shell.

13:49 How do you, I guess you invoke it by doing like the magic there as well.

13:53 It's the IPython shell, which is not the same as like a bash shell for instance, your terminal.

13:58 Interesting.

13:59 You could dive into a little bit more detail on what these two packages are.

14:03 So we have the base Jupyter AI package, which is spelled exactly as you might imagine it.

14:09 It's Jupyter hyphen AI.

14:11 That is a Jupyter Lab extension that brings a UI to Jupyter Lab, which is the screenshot that you're showing on your screen.

14:19 But for viewers without a screen, it basically is the package that adds that chat panel to the left-hand side and allows you to speak conversationally with an AI.

14:30 And then the second package is Jupyter AI magics, which is spelled the same, except at the end, it's spelled hyphen magics.

14:38 And that is actually the base library that implements some of the AI providers we use and brings things called magic commands to the IPython shell.

14:48 And magic commands basically let you invoke the library code, aka like calling link, using it to like call language models, for instance.

14:56 And that allows you to do it inside an IPython context.

15:00 So what's crazy is that if you run IPython in your terminal shell, you can actually run Jupyter AI from your terminal, which is pretty cool.

15:09 Yeah, I didn't realize that.

15:10 I mean, it makes sense, of course, but I hadn't really thought about it.

15:14 Yeah, I thought more of this like kind of a GUI UI type of thing that was alongside what you were doing.

15:18 Yeah.

15:19 We try to make it flexible.

15:20 And there's reasons for the magic commands, which I can talk about that later, though.

15:24 This portion of Talk Python To Me is brought to you by Posit, the makers of Shiny, formerly RStudio, and especially Shiny for Python.

15:34 Let me ask you a question.

15:36 Are you building awesome things?

15:38 Of course you are.

15:39 You're a developer or a data scientist.

15:40 That's what we do.

15:41 And you should check out Posit Connect.

15:44 Posit Connect is a way for you to publish, share, and deploy all the data products that you're building using Python.

15:50 People ask me the same question all the time.

15:54 Michael, I have some cool data science project or notebook that I built.

15:57 How do I share it with my users, stakeholders, teammates?

16:00 Do I need to learn FastAPI or Flask or maybe Vue or React.js?

16:05 Hold on now.

16:06 Those are cool technologies, and I'm sure you'd benefit from them.

16:09 But maybe stay focused on the data project?

16:11 Let Posit Connect handle that side of things?

16:14 With Posit Connect, you can rapidly and securely deploy the things you build in Python.

16:18 Streamlit, Dash, Shiny, Bokeh, FastAPI, Flask, Quattro, Reports, Dashboards, and APIs.

16:24 Posit Connect supports all of them.

16:27 And Posit Connect comes with all the bells and whistles to satisfy IT and other enterprise requirements.

16:33 Make deployment the easiest step in your workflow with Posit Connect.

16:37 For a limited time, you can try Posit Connect for free for three months by going to talkpython.fm.

16:43 That's talkpython.fm/P-O-S-I-T.

16:47 The link is in your podcast player show notes.

16:49 Thank you to the team at Posit for supporting Talk Python.

16:54 Now, one thing, you said this, but I want to emphasize it a little bit in that this will run anywhere IPython kernel runs, which is JupyterLab notebook, but also Google Colab, VS Code, other places as well, right?

17:09 So pretty much it comes to you wherever your Jupyter type of stuff is, right?

17:13 Yeah, and the same goes for your lab extensions.

17:15 So the great thing about lab extensions is that they work anywhere where the product is just sort of built on top of JupyterLab, right?

17:23 So Google Colab is essentially what Google has is a, well, I can't obvious, I obviously can't attest to what they're actually doing.

17:31 But most likely, it's like a set of extensions or CSS themes that are built on top of JupyterLab.

17:37 But like the underlying code is still JupyterLab.

17:39 It's still mostly JupyterLab.

17:41 So you can actually just install extensions and they work just fine, which is another reason why JupyterLab is just pretty awesome.

17:49 Yeah, it sure is.

17:51 Yeah, JupyterLab itself is basically a pre-selected, pre-configured set of extensions, right?

17:57 That's pretty cool.

17:57 That is true, yeah.

17:58 Giving preference to or showing just what I play with mostly, which is ChatGPT.

18:04 There's actually a lot of language models that you can work with, right?

18:07 One of the big things, and this is something I'll circle back to later, is that Jupyter AI is meant to be model agnostic, meaning that we don't discriminate against the choice of model or model provider.

18:18 Because as an open source project, it's very imperative that we maintain the trust of our users, right?

18:24 Users have to be sure that this isn't just some product that exists to force them and force or pigeonhole them into using a certain model provider like OpenAI or Anthropic.

18:36 It's the product makes no opinions.

18:38 We simply try to support everything as best as we can.

18:41 And we've written a lot of code to make sure that all of these models and just play nicely together, essentially.

18:49 Like every model provider, like let's say from Anthropic or AI21 or Cohere, every one of these APIs kind of has its own quirks.

19:00 Every one of these Python SDKs has its own quirks.

19:02 And we work very hard to basically iron out the surface and make everything have the same interface.

19:09 We can talk about that later, though.

19:11 Sure.

19:11 And it also makes it pretty easy to try them out, right?

19:13 If you switch from one to the other.

19:15 You're like, I wonder how Hugging Face versus OpenAI would do to solve this problem, right?

19:21 Absolutely.

19:21 Like that's kind of one of the ideas is like while, you know, certain model providers, they might offer a UI if they're very well funded by their investors.

19:32 For example, OpenAI to have a UI.

19:34 But that UI only allows you to compare between different models from OpenAI, which, you know, like as an independent third party looking to use an AI service, like that information is obviously a little biased, right?

19:49 Like you want to see like what other providers have to offer.

19:52 Like what does AI21 have?

19:53 What does Anthropic have?

19:55 And right now it's really, there really is no like cross model provider like UI or like interface in general.

20:04 But like that's kind of one of the use cases that Jupyter AI was intended to fit.

20:09 Yeah.

20:09 Provides a standard way to interact with all these and sort of compare them.

20:13 And it's also a UI for them.

20:14 And if you're using JupyterLab, yeah.

20:16 I'm not familiar with all of these different models and companies.

20:20 Do any of those run locally, like things like GPT for all, where it's a local model versus some kind of cloud?

20:27 Where's your key?

20:28 Where's your billing details and all that?

20:30 We actually recently just merged a PR that adds a GPT for all support.

20:35 Okay.

20:36 That's included in the release.

20:37 However, back when we first implemented this a few months ago, I had a few issues with the platform compatibility.

20:44 So like it wouldn't, so like some of the binaries that we downloaded from GP for all didn't seem to work well on my M1 Mac, for instance.

20:53 I'd say, yes, we do have local model support, but it's a bit experimental right now.

20:58 We're still like ironing out the edges and testing, like seeing how we can make the experience better.

21:03 Like does it sometimes only get bad output because we forgot to install the shared library?

21:08 Those are the type of questions that our team is wrangling with right now.

21:12 I see. And maybe I just jumped right into it, but maybe tell people what GPT for all is just real quickly.

21:17 GPT for all offers a few local models.

21:20 Actually, they offer several, I believe, not just a few.

21:23 I think they offer maybe like 10 to 15.

21:26 It's the numbers getting quite large.

21:28 Yeah.

21:28 To the point where I don't know what the right choice is.

21:30 I'm like, which one do I download?

21:32 There's they say they're all good.

21:33 Of course, they're going to say they're good.

21:34 I'll be frank.

21:35 I don't actually have that much experience with GPT for all.

21:38 It's we mainly use them as sort of a provider for like these free and open source language models.

21:44 Yeah.

21:44 I think they offer UI as well for multiple platforms.

21:48 I've only played with them a little bit, just started checking it out.

21:50 But it's basically a local.

21:52 You download the model.

21:54 You run it locally.

21:54 You don't pay anything because it's just just running on your machine.

21:58 As opposed to, say, OpenAI and others where you've at least got rate limiting and a certain amount of queries before you have to pay and maybe potentially access to better models like ChatGPT-4 versus 3.5 and so on.

22:12 That's also taking the characteristics of these service providers for granted, right?

22:18 So, yes, definitely, while it does hurt the wallet to pay for usage credits, right?

22:26 It's also pretty remarkable how small the latency has gotten with some of these APIs.

22:31 I've gotten sub-500 millisecond latency on some of these APIs.

22:35 And that's really incredible because when I was using GPT-4-All, the latency was a little bit high on running locally with limited compute resources.

22:44 It's really remarkable how fast these APIs are.

22:48 It is pretty insane.

22:49 Sometimes it drives me crazy.

22:51 I'm just only, again, referring to ChatGPT because I don't have the experience with the others to the degree.

22:56 But it drives me crazy how it artificially limits the response based on the speed of the response so it looks like it's chatting with you.

23:04 I'm like, no, I have four pages of stuff because you just get it out.

23:08 You know, it'll say something like, you can ask, I gave you a five-page program.

23:14 Let's call it X.

23:15 If you say, what is X?

23:17 It'll just start printing it slowly, line by line.

23:19 Like, you know you just are echoing it back.

23:21 Just get it out.

23:22 I want to ask you the next question.

23:24 You know what I mean?

23:25 In that case, that's actually a feature request that we've gotten because it doesn't actually slow down.

23:31 Like, it's not just like a pointless animation.

23:33 Yeah, the servers are streaming essentially token by token, right?

23:37 As the language model generates output.

23:39 So it's kind of more like a progress indicator than a superfluous animation.

23:44 Yeah.

23:44 Yeah, of course.

23:45 But if you've got something large, large blocks of text you're working with, it can be a drag.

23:50 All right.

23:51 I wanted to kind of touch on some of the different features that I pulled out that I thought were cool.

23:56 I mean, obviously it goes without saying that Jupyter AI is on GitHub.

24:01 I mean, because it's software.

24:03 And so it's open source, which I don't know if we said that, but obviously free open source on GitHub.

24:09 BSD3 license.

24:11 But it's also noteworthy that it's officially under the JupyterLab organization, not under the David account.

24:19 You know what I mean?

24:20 It's officially part of the JupyterLab subproject.

24:23 And yep, as you pointed out, we're under the JupyterLab GitHub org as well.

24:27 Yeah, that's awesome.

24:28 Let's talk about some of the different things you can do with it.

24:32 Some of them will be straightforward, like just like, how do I write a function, Jupyter AI?

24:36 And others, I think, are going to be a little more interesting.

24:40 So let's start with asking something about your notebook.

24:43 Tell us what people can do here.

24:45 Asking about your notebook basically means like you can actually teach Jupyter AI about certain files, right?

24:51 So the way you do this is via a slash command in the chat UI.

24:55 So you type slash learn and then file path.

24:58 And it essentially teaches the Jupyter AI about that file.

25:03 Now, it works best with files that are written in natural language, right?

25:08 So like text files or markup or markdown rather.

25:11 Yeah.

25:12 So like those, like, especially like developer documentation as well, right?

25:16 It works really well with those.

25:18 It works best with those kinds of files.

25:20 And after Jupyter AI learns about these files, you can then ask questions about the files.

25:27 It's learned by prefixing your question with the slash ask command.

25:32 That is so cool.

25:33 It is pretty cool.

25:34 I know it's so cool because what I've done a lot of times, if I want ChatGPT to help me, it's like, I'm like, all right, well, let me copy some code.

25:41 Right.

25:42 Then I'm going to have a conversation about it.

25:44 But a lot of the context of, well, it's actually referencing this other function.

25:48 And what does that do?

25:49 Or just a broader understanding of what am I actually working on is missing, right?

25:55 Because I've only copied it.

25:56 You can't paste, you know, 20 files into ChatGPT and start talking about it.

26:01 But with this, you can, right?

26:02 You can say, learn about, you can say, learn about different things, right?

26:06 You can say, learn about your notebook, but you can also probably tell it like, learn about my documentation or learn about my data set.

26:13 And now let me talk to you about it.

26:14 What's interesting is that the right now, while it works best for natural language documents, we are working on improving the experience for code.

26:23 From our testing, like the code is mostly the capabilities of Jupyter AI after learning code is right now mostly limited to explaining what code does, but sort of like explains it from the doc screen.

26:36 So we're working on a way to format the code in a manner that is more interpretable to a language model.

26:43 Sure.

26:44 We're working on ways to improve the experience for code.

26:47 But yeah, definitely the long-term vision is to have Jupyter AI literally be able to learn from the whole directory of files or possibly even a URL, like a remote URL to like, like a documentation page.

27:01 Yeah.

27:01 We have some big ideas there.

27:02 We're still working on them.

27:03 I want to work with a new package XYZ.

27:06 Like, I don't know what XYZ is.

27:07 You know what?

27:08 Here's where you can learn about it.

27:10 Go and figure it out.

27:11 Like, query the PyPI API, get the docs page from the metadata, and then go to that URL, scrape it.

27:18 Like, lots of things we're exploring.

27:20 It's still kind of early days for this, right?

27:22 You've been at it about a year or so?

27:23 It's been out for a while.

27:25 Recently, I've had to work on a few other things as well, like Jupyter AI.

27:29 Unfortunately, I cannot give my entire life to Jupyter AI.

27:33 So I've been working on a few other things these past few months.

27:36 But yeah, there are a lot of things that I envision for Jupyter AI.

27:41 I have a much bigger vision for what I want this project to be and what it can be capable of.

27:46 Exciting.

27:47 So this screenshot that you got here in the section that I'll link to in the show notes

27:52 is cool because you can select a piece of a portion, not even a whole cell,

27:56 but a portion of code in a cell.

27:59 And then you can ask, what does this code do?

28:01 We have an integration with the JupyterLab editor APIs.

28:05 So you can select a block of code and then include that in your prompt.

28:10 And it will be appended to your prompt below, right?

28:13 Right.

28:13 It's appended to Bob, sorry.

28:14 You can basically ask, like, so you can select a block of code.

28:17 So in this screenshot right here, there's this block of code that computes the least common

28:23 multiple of two integers, right?

28:25 And you can select that and then click Include Selection and then ask Jupyter AI,

28:30 what does this do?

28:31 Which is pretty awesome.

28:33 Another checkbox is Replace Selection.

28:35 I'm guessing that is like, help me rewrite this code to be more efficient or if there's

28:40 any bugs, fix it.

28:41 So the Replace Selection checkbox is totally independent.

28:44 So you can actually use both at the same time.

28:47 And one of the use cases for this is refactoring.

28:50 And I've actually applied this in practice a few times where you can basically select a

28:55 block of code and then click both Include and Replace Selection.

28:59 And then you can format your prompt to say, refactor this block of code.

29:03 Do not include any additional help or text.

29:06 And when you send that prompt over, it will actually refactor the code for you in your

29:11 notebook, which is, yeah, like is, it's pretty great.

29:15 That's pretty awesome.

29:16 You know, you could do things like refactor this to use guarding clauses.

29:20 So it's less nested and less, less arrow code or whatever, right?

29:24 Yeah.

29:25 Or like add a doc string, right?

29:27 Summarize this purpose of this function and then enclose that in the doc string and add it

29:32 to the function.

29:33 Right.

29:33 Or this code is Panda's code, but I'd like to use Polars.

29:36 Please rewrite it for Polars, which is not a super compatible API.

29:39 It's not like DAST to Panda's, where it's basically the same.

29:42 Yeah.

29:42 And this kind of circles back to that question that you had asked earlier.

29:45 I think I went on a tangent there and didn't fully answer.

29:48 But like, what is like the utility of Jupyter AI to like data practitioners, right?

29:53 So we're talking data scientists, machine learning engineers, like this, the Include Selection

29:57 features.

29:58 We've heard great feedback about how helpful it is to like actually explain a data.

30:03 So sometimes like you're working with a test set and it's not immediately clear what the

30:09 features of this test set are or like what this even does, because sometimes it's like

30:13 high dimensional data and they can literally select it and then click Include Selection

30:17 and say, and tell Jupyter AI, explain to me what this does.

30:21 Just like, what is this like data frame stuff?

30:24 Like, whoa, we got data frames in data frames.

30:25 Like what's going on here?

30:27 Like what even is the structure?

30:28 That's awesome.

30:29 And I think it's super valuable.

30:30 I think the, and this is like a little bit I was getting to before one of the features

30:33 that I think is cool.

30:34 Whereas if you just go with straight ChatGPT, you copy your code, you paste it into the chat.

30:40 Hopefully it doesn't say it's too much text and then you can talk about it.

30:43 But then when you get an answer, you've got to grab it, move it back over.

30:46 And this just, this fluid back and forth is really nice.

30:50 Yeah.

30:50 And that's actually one of the design principles that we worked out when first starting this

30:56 project officially is the idea that Jupyter AI should be human centered.

31:00 As in, you shouldn't be expected to be a developer to know how to use this tool.

31:05 Like this tool is for humans, not, not for any specific persona, just for humans in general.

31:10 That's awesome.

31:11 Yeah.

31:11 So in this case, you select the function that does the lowest common denominator bit and

31:16 you ask it what it does, it says the code will print out the least common multiple of

31:21 two numbers passed to it.

31:22 Super simple, very concise.

31:24 Okay, great.

31:25 Now we can go on to the next thing, right?

31:27 Yeah.

31:28 There's this LCD function that we're kind of talking about here.

31:32 This example is recursive, which I think recursion is pretty insane, right?

31:39 As just a concept for people to get their head around.

31:42 This is the iterative version.

31:43 So this is after they, yeah, this is the iterative.

31:46 Oh, this is after.

31:46 Yeah.

31:46 So if we go back up.

31:48 Oh, one of the things you ask it is things like an example is rewrite this function to

31:52 be iterative, not recursive.

31:54 Right.

31:54 Which is actually really, really awesome.

31:56 Right.

31:57 You're like, this is breaking my brain.

31:58 Let's, let's see if we can not do that anymore.

32:01 This portion of talk Python to me is brought to you by us.

32:06 Have you heard that Python is not good for concurrent programming problems?

32:11 Whoever told you that is living in the past because it's prime time for Python's asynchronous features.

32:16 With the widespread adoption of async methods and the async and await keywords, Python's ecosystem

32:22 has a ton of new and exciting frameworks based on async and await.

32:26 That's why we created a course for anyone who wants to learn all of Python's async capabilities,

32:31 async techniques and examples in Python.

32:33 Just visit talkpython.fm/async and watch the intro video to see if this course is for you.

32:39 It's only $49 and you own it forever.

32:42 No subscriptions.

32:43 And there are discounts for teams as well.

32:48 Another thing I wanted to talk about, and you talked a fair amount about this in your presentations that you did.

32:54 I can't remember if it was the JupyterCon or the PyData one that I saw, but one of those two.

32:59 You talked about generating new notebooks and how it's, it's actually quite a tricky process.

33:06 You got to break it down into little steps because if you ask too much from the AI, it kind of doesn't give you a lot of great answers.

33:12 Tell us about making new notebooks.

33:13 Like why would I even use, like I can go to JupyterLab and say file new, it'll make that for me, right?

33:18 So what's this about?

33:19 The generate capability is great because it generates a file that is essentially a tutorial,

33:25 that can be used as a tutorial to teach you about new subjects, right?

33:28 So like you could, for example, submit a prompt, like slash generate a notebook about asyncio

33:34 or a demonstration of how to use Matplotlib.

33:38 And after, so this will take a bit of time, but eventually Jupyter AI is done thinking and generates a file

33:45 and it names a file up and generates a notebook.

33:48 And the notebook has a name, it has a title, it has like sections, table of contents,

33:53 and each of the cells within it, like is tied to some like topic that is determined to be helpful

33:59 and to answer the user's question.

34:02 Awesome.

34:03 Could I do something like, I have weather data in this format from the US weather service.

34:10 Could you generate me a notebook to plot this XYZ and help me answer these questions?

34:15 Or like, could I ask it something like that?

34:17 Not at the moment.

34:18 So like that would best be done.

34:20 That would be done best if, since the data is already, like, I'm assuming that the data is already

34:26 available, right?

34:27 In some kind of like format.

34:29 So like in a notebook, you could use a chat UI to like select that entire, select that selection

34:34 and then tell it, tell Jupyter AI to generate code to plot that data set.

34:40 So right now, generate only takes a natural language prompt as its only argument.

34:45 I see.

34:46 So it's kind of like stateless in that regard.

34:48 So in this case, you can say slash generate a demonstration of how to use matplotlib.

34:53 And then the response is great.

34:54 I'll start working on your notebook.

34:56 It'll take a few minutes, but I'll reply when it's ready.

34:58 In the meantime, let's keep talking.

35:00 So what happens behind the scenes that takes a few minutes here?

35:05 This is a bit interesting.

35:06 It does kind of dive deep into like the technical details, which I'm not sure if, which do you

35:12 want to just like dive?

35:13 Yeah, let's go tell us how it works.

35:14 Probably a good chance to explore that.

35:16 Yeah.

35:16 So slash generate, the first thing is that the prompt is first expanded into essentially

35:22 a table of contents.

35:23 So basically we tell the language model, generate us a table of contents conforming to this JSON

35:29 schema.

35:30 And when you pass a JSON schema included in your prompt, the language model will be much

35:36 more, will have a much higher likelihood of returning exclusively a JSON object that matches

35:42 that JSON schema that you had initially provided.

35:45 So in our case, we generate a table of contents.

35:49 And then we take that table of contents.

35:51 And then we say for each section, we do this in parallel, like generate us some code cells

35:56 that are appropriate for teaching this section of the document.

36:00 So for example, for matplotlib, right?

36:03 Like maybe the first section is your first, like generating your first plot, plotting 3D functions.

36:08 And the next one is like plotting complex functions with phase or something like that.

36:13 And then with each of these sections, we then send another prompt template to the language

36:18 model for each of these sections, asking it to generate the code.

36:21 And then at the end, we join it all together and then we save it to disk and then emit that

36:26 message and say, we're done.

36:27 Maybe the English literature equivalent would be instead of just saying, write me a story about

36:33 a person who goes on an adventure and gets lost.

36:35 It's like, I want, give me an outline, bullet points of interesting things that would make

36:41 up a story of how somebody goes on an adventure and gets lost.

36:44 And then for each one of those, you're like, now tell this part of the story.

36:47 Now tell this part.

36:47 And somehow that makes it more focused and accurate, right?

36:51 The main limitation is that because we're a model agnostic, language models are limited

36:56 in how much output they can generate, right?

36:59 Yeah.

36:59 The issue we were running into when we were trying to do the whole thing all at once, like generate

37:04 mean the whole notebook is that some language models just couldn't do it.

37:07 In an effort to, you know, sort of stay model agnostic, we deliberately implemented, we deliberately

37:12 broke this process down into like smaller subtasks, each with its own like prompt template in order

37:18 to accommodate these models that may lack the same token size windows that other models have.

37:26 I think just even for ones that have large token spaces, I think they still, the more specific

37:32 you can be, the more likely you're going to get a focused result instead of a wandering vague result.

37:38 Teach me about math or teach me how to factor, you know, how to integrate this differential,

37:44 solve this series of differential equations or physics.

37:47 Like you're going to get a really different answer to those two questions.

37:50 Yeah.

37:51 That's also circles back to like a topic I didn't, I did want to call out, but I don't, I don't think

37:56 we hit on it.

37:56 Is that the chat UI actually does support rendering in both Markdown and LaTeX, which is a markup

38:03 language for math.

38:04 So you can ask it both complex engineering and mathematical questions, like asking it to explain

38:10 you like, yeah.

38:11 So like there might be a demo here.

38:13 I'm not sure if it's on this page though.

38:16 So if I had a Fourier, fast Fourier transform in LaTeX and I put it in there and say, what

38:22 is this?

38:22 It'll say it's a fast Fourier transform or something like that.

38:25 Yes.

38:25 And it also works the other way around.

38:27 You can also use it to say like, Hey, explain to me what the 2D Laplace equation is, or explain

38:33 to me like, what does this do?

38:35 Right.

38:35 And it will actually generate and format the equation inside the chat UI, which is really

38:42 remarkable.

38:43 I love that feature.

38:44 It's actually really awesome.

38:45 And it's also really appropriate for a scientific oriented thing like Jupyter, right?

38:50 The remarkable thing is that because chat UBT and like other such language models, like the

38:56 ones from Anthropik and AI21, because like they are founded on the premise of where like their

39:02 functionality comes from having such a large corpus of data.

39:05 They know a remarkable amount of information.

39:07 So like we've tried like some example notebooks of quantum computing and explained those really

39:13 well.

39:13 We try, I tried one of like the Black Scholes options pricing model used in financial engineering.

39:19 It's really remarkable.

39:21 Like the utility that it offers just by being there in the side panel.

39:25 Like you essentially have like a math wizard available to you and to your lab all the time.

39:30 It's probably better than a lot of math professors in terms of not necessarily in the depth of

39:35 one area.

39:36 But, you know, if you ask somebody who does like abstract algebra about real analysis, they're

39:42 like, I don't really do that part.

39:43 Or if you ask somebody about real analysis about number theory, like I don't really, you can

39:46 hit on all the areas, at least a generalist professor sort of thing.

39:50 We talked about the slash learn command.

39:53 That's pretty excellent already and where that's going.

39:56 So I'm pretty excited about that.

39:58 Yeah.

39:58 It actually does have a lot of interesting technical tidbits to it.

40:02 Like the implementation.

40:03 Okay.

40:04 Yeah.

40:04 Actually, this is one of the really challenging things with these chat bots and things.

40:09 For example, I've tried to ask ChatGPT if I gave it one, just one of the transcripts from

40:14 the show.

40:15 I want to have a conversation about it.

40:16 It says, ah, that's too much.

40:17 I can't do it.

40:18 You know, it's just, it's just one, one show.

40:21 And in doc, like in your documentation, there might be a lot of files in there, right?

40:25 More than just one transcript levels worth.

40:28 So that alone, I think it's kind of interesting just how to ingest that much data into it.

40:32 Yeah.

40:32 And you know, this is a very interesting subject and it actually is a bit complex.

40:38 I'm sure it is.

40:39 I think there are some other features you want to discuss.

40:41 Let's dive into this for just a minute because I think it is interesting.

40:44 How do you make, because this makes it yours, right?

40:46 It's one thing to ask vaguely, like, tell me about the Laplace equation and how does it

40:50 apply to heat transfer?

40:51 Like, okay, great.

40:52 I have a specific problem with a specific library and I want to solve it.

40:56 And you don't seem to understand about enough of it.

40:58 So it really limits the usefulness if it doesn't, if it's not a little closer to what you're

41:03 actually doing.

41:03 And I think this brings it closer.

41:04 So yeah, tell us about it.

41:06 Language models aren't just governed by like their intelligence, however you measure that,

41:11 right?

41:11 They're also governed by how much context they can take.

41:14 So one of the reasons ChatGPT was so remarkable is that it had a great way of managing context

41:19 through conversation history.

41:20 And that seemingly small leap and seemingly small feature is what made ChatGPT so remarkably

41:30 disruptive to this industry is because of that additional context.

41:34 And we think about extending that idea.

41:37 How do we give an AI more context, make it even more human-like and personal?

41:41 Well, the idea is similar.

41:44 We add more context and that's what learning does, right?

41:47 And so the way learning works is that we're actually using another set of models called

41:52 embedding models.

41:53 And embedding models are very, very underrated in the AI-like modeling space, right?

42:00 These are really remarkable things.

42:02 And they have this one, I'll only cover like the most important characteristic of embedding

42:07 models, which is embedding models take syntax and map it to a high-dimensional vector space

42:15 called a semantic space.

42:16 And inside of the semantic space, nearby vectors indicate semantic similarity.

42:23 I know that's like a lot of words, so I'm going to break that idea down, right?

42:26 So like canine and dog, let's take these two words as an example, right?

42:30 These two words are completely different.

42:32 They don't even share a single character and similarity together, right?

42:36 They don't have a single letter in common with one another.

42:39 And yet we know as humans that these two dogs, these two words mean the same thing.

42:44 They refer to a dog.

42:46 So like they have different syntax, but the same semantics, the same semantic meaning.

42:51 So their vectors would be...

42:52 Would be mapped close.

42:53 Would be very close by whatever metric you're using, yeah.

42:56 If you extend this idea and like you imagine, okay, what if you split a document?

43:01 What if you split a file into like one to two sentence chunks?

43:05 And then for each of these like sentences, let's just say sentences, for example.

43:08 Let's say we split a document to sentences and then we take each of those sentences and then map,

43:14 use an embedding model to compute their embedding and then store them inside of a vector store.

43:19 Like basically like a local database that has, that just stores all of these vectors in like a file or something, right?

43:26 Now imagine what happens if we then take a prompt, like a question that we might have,

43:31 encode that as an embedding.

43:33 And then we say to the vector store, okay, for this prompt embedding, find me all of the other embeddings are close to this.

43:41 Well, what you've just done in this process is called a semantic search.

43:46 So it's kind of like syntax search, except instead of searching based off of key words or tokens or other syntactic traits,

43:53 you are searching based off the actual natural language meaning of the word.

43:59 This is much more applicable when it comes to like natural language prompts and natural language,

44:04 like corpuses of data, because this is like the actual information that's being stored.

44:09 Not, we don't care about the characters.

44:11 We care about the information that they represent, right?

44:14 The essence of it, yeah.

44:15 And these vectors are computed by the larger language model?

44:18 The vectors, the embeddings are computed by an embedding model.

44:22 And they're actually a separate category of model that we have our own special APIs for.

44:27 So in our settings panel, you can change the language model.

44:31 And I think we already discussed that, right?

44:33 Yeah.

44:33 But what's interesting is that underneath that, you'll also see a section that says embedding model.

44:38 And you can change the embedding model to like, we also offer that same principle of model agnosticism there.

44:45 Yeah.

44:46 This is very interesting.

44:47 Very interesting.

44:48 Let's talk a little bit about the format.

44:49 You said, obviously, that you can do LaTeX, which you say in, you say math, right?

44:54 You tell us, give me math.

44:56 Yeah.

44:56 Give me math.

44:56 Which is, yeah, it's pretty interesting.

44:58 But you can do images, markdown, code, HTML, JSON, text.

45:02 There's a lot of, a lot of different formats you can get the answer back in.

45:05 When you use the AI magics, we can pass them to like a renderer first before we show the output to the user.

45:13 Yeah.

45:13 And with the AI magic, the %AI in the cell, you can also specify, that's where you put the format potentially,

45:20 but you can also specify the model and the service, I guess, for the provider.

45:25 The IPython magics are basically stateless in the sense that you always have to specify the model explicitly.

45:32 They don't operate off the premise that you are using JupyterLab.

45:36 They don't run off the premise that you have JupyterLab installed or are using the lab extension that we offer.

45:42 Because of that, like the model is stated explicitly every time.

45:46 That's by design.

45:47 When you sit down, what is your JupyterAI provider set to?

45:53 What's your favorite?

45:54 As a developer, I like literally pick a random one to give the most test coverage at all times.

46:01 And that's actually a great way of finding bugs.

46:03 So, yeah, I don't have a favorite one.

46:05 My favorite one is the one that works.

46:07 And hopefully that should be all of them.

46:09 So.

46:09 I guess you can also tell it to forget what I was talking about.

46:14 We're going to start over.

46:15 That's pretty interesting that you can do that along the way because you maybe had a bunch of conversations.

46:20 And we talked about the benefit of it, like knowing the history of that conversation.

46:24 But you're like, all right, new idea.

46:25 Switching topics.

46:27 Chat.

46:27 I think the last one I wanted to talk about specifically was interpolating prompts.

46:32 Kind of almost like f-strings where you can put in the prompt text, you can put a variable and then other parts of your notebook or program can set that value.

46:43 Yeah.

46:43 Tell us about this.

46:44 You can define a variable in your IPython kernel, right?

46:48 So, like, and that's just dumb, but just like how you define any other variable.

46:52 But what's interesting is that IPython is actually aware of the variables that you are defining.

46:57 So, we can programmatically access that when we implement the magic, right?

47:02 Basically, if you define any variable at the top level scope, like let's say poet equals Walt Whitman, right?

47:09 So, we have a name variable called poet.

47:10 And then you can send a prompt over, like, write a poem in the style of curly braces poet and curly braces.

47:18 And when that is run, the value of that variable is interpolated and substitutes around the curly braces.

47:27 So, the final prompt becomes write a poem in the style of Walt Whitman.

47:31 And when that prompt is sent, well, you can imagine it generates a poem of Walt Whitman.

47:35 The variable interpolation that we offer in IPython is very useful for, like, very quick, like, debugging.

47:42 So, you can actually reference a cell in the notebook directly.

47:47 I think a lot of people don't know this, but, like, in a Jupyter notebook, there's, like, the in and out indicators to the left of each cell.

47:54 So, it'll say, like, in dot one, out dot one, right?

47:57 So, those are actual variables.

47:59 And you can use them here, too.

48:01 So, you can reference, like, debug.

48:03 Tell me why this code is failing.

48:05 Curly braces in dot one.

48:07 That's, you just, on the screen there, just scroll there.

48:11 Explain what this does, the in bracket 11, or what went wrong in out bracket 11, or, yeah, something like this.

48:19 Right.

48:19 It's fine long as you don't go and rerun that cell.

48:22 Yeah.

48:22 But, like you said, this is not for long term.

48:25 You can make it independent of the order of execution just by, like, assigning whatever variable that is to, whatever, like, content that is to a named variable.

48:35 That way, no matter what order you run them, like.

48:38 People might be thinking, when you describe this interpolation thing, just bracket, boom, bracket, curly brace, curly brace, you're like, we already have that in Python.

48:46 You just put an F in front of the string, and so on.

48:48 But this is in the message that goes to the AI cell magic, not straight Python, right?

48:55 That's the relevance.

48:55 That's why this is interesting, right?

48:57 Yes.

48:57 I guess it really comes down to the different models that you select.

49:00 So, you opt into this a little bit, or maybe you need to understand it that way.

49:05 But talk to us a bit about privacy.

49:07 If I select something and say, what does this do?

49:10 What happens?

49:11 Something important to emphasize here is that whenever you use a language model that's hosted by a third party, so, like, it's running on their servers, right?

49:21 Regardless of whether this model is free or not.

49:24 Like, the fact that you're sending data to a third party over the internet, like, that's where the privacy and security concerns happen, right?

49:33 And that happens, like, whenever, like, you're sending data across the wire over the internet.

49:37 But we have some special safeguards in place here, specifically to assuage fears of, like, concerns over privacy and security that a lot of open source users have.

49:49 And one of the important ideas here is that Jupyter AI is both transparent and traceable.

49:54 So, when we send a prompt to a language model, that's always captured in the server logs by default.

50:02 So, that's always being logged.

50:03 That's always being captured.

50:04 Yeah.

50:05 So, like, it's always going to be traceable.

50:07 There's no secret back channel.

50:09 Like, you tell them, people, this is happening, okay?

50:11 Yeah.

50:12 So, if, like, an operator needs to audit, like, oh, dang, like, let me check just to make sure nothing scary was sent over to OpenAI.

50:20 Well, the operator can review the server logs and make sure that all usage is compliant with whatever privacy policy their company has, right?

50:30 And Jupyter AI is also exclusively user-driven, meaning that we will never, by default, send data to a language model, even if you selected one, right?

50:43 Like, we will never send data to that language model until explicit action is done by the user.

50:48 So, in this case, like, clicking the send button, clicking shift, enter, and running to sell.

50:53 Yeah.

50:53 Nothing is sent to language model or embedding model until that happens.

50:57 That's really all you can do.

50:58 That sounds great.

50:59 Because you don't control what happens once it hits OpenAI or Anthropic or whatever, right?

51:04 That's why the transparency is so important, right?

51:06 And, oh, I forgot to touch on traceability.

51:09 So, like, with these AI-generated cells, right, so, like, the output cells, in the metadata, we indicate that with, like, the model that was used to generate an output cell, if it comes from the Jupyter AI magic.

51:21 So, that way, it's also, like, traceable not just in the logs, but, like, in the actual files metadata and stuff as well.

51:28 That's cool, because it'd be really easy to say, have the cell magic, and then, you know, how notebooks store their last output.

51:35 Like, if you upload them to GitHub, they'll keep their last output, unless you say clear all cell outputs.

51:40 Depending on which model you have selected, you might not get the same output, not even close to the same output.

51:46 So, you might want to know, like, well, how did you get that picture?

51:49 Oh, I had Anthropic selected and not the other, right?

51:52 Like, that is really nice.

51:53 I've actually used the server logs myself to debug, like, prompt templates, for instance, right?

51:58 So, like, because what we show in the logs is the full prompt.

52:02 Like, after applying our template, after applying, like, the edits, like, that's what's actually shown.

52:07 So, that's really also really helpful for developers who need to debug what's happening.

52:12 Yeah, of course.

52:12 Or if you're a scientist and you're looking for reproducibility, I doubt there's much guaranteed reproducibility even across the versions of the same model.

52:20 But at least you were in the same ballpark, you know, at least I know what model it came from.

52:25 You can set the temperature to zero, but it won't generate very fun output.

52:29 That is a workaround if you truly need a reproducibility.

52:33 I suppose, yeah.

52:34 The temperature being the ability to tell it how creative do you want to be or how focused do you want the answer to be, right?

52:41 It's a hyperparameter that basically governs the randomness, like, how far away from the mean it's willing to deviate.

52:49 Kind of, yeah, vaguely describable as creativity, yeah.

52:52 Yeah, I suppose.

52:53 I guess if you're looking for privacy, the GPT for all might be a good option.

52:58 Oh, yeah, absolutely.

52:59 For that, right?

52:59 Because that's not going anywhere.

53:01 Yeah.

53:01 However, some of them do have license restrictions.

53:04 And that's also why we have also taken it this low when it comes to adding more GPT for all support is because different models are licensed differently.

53:14 And that's another consideration we have to take in mind.

53:17 Yeah.

53:17 Yeah, of course.

53:18 You're playing in a crazy space with many different companies evolving licenses.

53:22 Yeah.

53:23 Let's close it out with one more thing here.

53:25 Maybe two more things.

53:26 One is there's a lot of interest in these agents.

53:30 And it sounds, for example, your create a notebook that does this sort of thing, like teach me about Matplotlib, is a little bit agent driven.

53:38 Thinking of like LangChain and stuff like that, or even GPT engineer.

53:43 What's the story?

53:45 Do you have any integrations with that?

53:46 Any of those types of things?

53:47 We're working on one where basically you won't need to use slash commands anymore.

53:52 So this is like, again, like we're just kind of playing around with this, seeing how well it behaves.

53:58 But we are trying to use agents to, for example, remove the need to use slash commands.

54:02 So when you say, like generate a notebook, like it will just generate one.

54:07 You don't have to note a slash command for that.

54:09 Like it will just know.

54:10 So like, yeah.

54:11 However, we don't have any, we don't use agents at the present moment now.

54:15 And the reason for that is that they have very, they're not very model agnostic, at least the ones from our research.

54:23 Like they only work well for a specific model and a specific prompt template.

54:27 But beyond that, it's hard.

54:29 All right.

54:29 Last question.

54:30 Running out of time here.

54:31 Yeah.

54:32 What's your favorite thing you've done with Jupyter AI?

54:33 Not like a feature, but what have you made it do to help you that you really like?

54:38 Teach Jupyter AI about Jupyter AI.

54:40 That's an easy one.

54:42 So we have the documentation.

54:44 What did it learn?

54:44 Well, learn about itself.

54:46 So like I was at a conference and I, so this was at PyData and I actually got enough questions to the point where I did, I just downloaded, I had the documentation already available in my home directory because of some other previous.

55:00 Or restructured text them.

55:01 Yeah.

55:02 And the markdown source.

55:03 Okay.

55:03 The markdown source.

55:04 Right.

55:05 And then I just had slash learn.

55:07 I just learned that doc.

55:08 And then I just had that laptop on the side and told people, like, if you have any questions, try answering that.

55:14 Yeah.

55:15 In case you don't want to wait in line.

55:17 So yeah, that was, yeah, it's pretty remarkable.