Grokking Algorithms in Python

Episode #82, published Thu, Oct 27, 2016, recorded Mon, Oct 24, 2016

Episode Deep Dive Transcript

Algorithms underpin almost everything we do in programming and in problem solving in general. Yet, many of us have partial or incomplete knowledge of the most important and common ones. In this episode, you'll meet Adit Bhargava, the author of the light and playful Grokking Algorithms: An illustrated guide book.

If you struggled to understand and learn the key algorithms, this episode is for you.

Links from the show:

Adit on the web: adit.io
Book: Grokking Algorithms: An illustrated guide (use code talkpython45 for 45% off):
manning.com/books/grokking-algorithms
Book: Grokking Algorithms: An illustrated guide (Amazon):
amzn.to/2dQngeA
Grokking Algorithms GitHub: github.com/egonSchiele/grokking_algorithms
Adit on Twitter: @_egonschiele
High perf search of Talk Python: talkpython.fm/search

Episode Deep Dive

Guest Introduction and Background

Adit Bhargava is the author of Grokking Algorithms: An Illustrated Guide and a seasoned software engineer who has taught Python courses and holds a master's degree in computer science. His unique background in both programming and graphic design led him to craft an algorithms book featuring playful illustrations and beginner-friendly examples. He also blogs about technical topics on adit.io and has open-sourced all the images from his book on his GitHub (github.com/EgonSchiele). Adit joined this episode to discuss the importance of algorithms, their practical applications, and how visual explanations can demystify complex computer science topics.

What to Know If You're New to Python

If you're coming to this topic with minimal Python experience, here are a few considerations to help you follow the concepts in this episode:

Basic Data Structures (lists, dictionaries): Be comfortable with Python's core containers (lists and dicts) as a foundation.
Simple Control Flow: Understanding conditionals and loops (if-statements, for-loops) will help you get the most out of the algorithmic concepts.
Recursion Overview: Know that functions can call themselves to break down problems. This is a key idea behind many algorithms Adit discusses.
Pythonic Approach: Familiarity with Python's readability and straightforward syntax will let you focus on the core ideas rather than the language details.

Key Points and Takeaways

Power of Illustrated Algorithms Adit emphasizes how visual, playful examples lower the barrier to learning complex topics in computer science. By illustrating algorithms with boxes, buses, or everyday stories, the concepts become intuitive rather than abstract. This approach helps students and experienced developers better understand and retain how algorithms work.
- Links and Tools:
  - Grokking Algorithms book (Manning Publications)
  - Adit's Blog: adit.io
  - GitHub (EgonSchiele)
Big O Notation and Performance Big O notation measures how an algorithm's running time scales with input size. It doesn't tell you the exact speed, but rather how performance grows (e.g., linear, logarithmic, quadratic). Understanding Big O lets you compare algorithms (e.g., O(n) for linear search vs. O(log n) for binary search) and make informed decisions about efficiency.
- Links and Tools:
  - Wikipedia on Big O notation
Binary Search vs. Linear Search Binary search requires the data to be sorted but delivers logarithmic time complexity by continually halving the input. Linear search is simpler and works on unsorted data but is O(n) in the worst case. The classic "guess a number" game (between 1 and 100) illustrates the speed of halving the search space, showing how 4 billion possibilities can be narrowed down with only 32 guesses.
- Links and Tools:
  - Python docs on list methods
Arrays vs. Linked Lists Arrays store elements in a contiguous block of memory for quick random access (O(1) to jump to index i), but inserting new elements can be costly. Linked lists allow cheap insertion or removal because only pointers need updating, yet random access is O(n) since you must traverse from the start. Depending on whether you need more efficient lookups or frequent insertions, you choose an array or linked list.
- Links and Tools:
  - Visual guide to arrays vs. linked lists (blog post by Adit)
Recursion and Divide and Conquer Recursion shines when a problem naturally decomposes into smaller subproblems, such as quick sort and tree traversals. A base case handles the simplest scenario, and the recursive step tackles the bigger challenge by reducing it to smaller instances. Divide and conquer further harnesses recursion by systematically breaking down data, like in the quick sort example, into subarrays around a pivot.
- Links and Tools:
  - Quick sort algorithm (Wikipedia)
Hash Tables (Python Dictionaries) Hash tables (dictionaries in Python) provide average case O(1) lookups, making them extremely versatile for storing and retrieving data by key. Adit's "cashier Maggie" example illustrates the quick lookup advantage: you just ask "How much is an avocado?" and instantly get a price. In real-world Python, dictionaries power everything from fast membership checks to building lightning-fast data pipelines.
- Links and Tools:
  - Python docs on dict
Graph Traversal and Shortest Paths Many real-world problems are graph-based (e.g., bus routes, social networks). Representing these as nodes and edges lets you apply standard algorithms like breadth-first search (BFS) to find the shortest path. Whether it's a ride across town or a link to a friend-of-a-friend, BFS can answer "How many steps away is X?" or "How can I get from A to B with the fewest transfers?"
- Links and Tools:
  - NetworkX (Python library for graph analysis)
Greedy Algorithms A "greedy" solution picks the local optimum at each step. Though it doesn't always yield the perfect global result, it often provides a good enough or near-optimal solution. Examples include the knapsack problem and scheduling tasks by earliest finishing time, both of which can be solved quickly (though not always perfectly) by greedily selecting the best immediate choice.
- Links and Tools:
  - Knapsack problem (Wikipedia)
Dynamic Programming Dynamic programming uses memoization (storing subresults) and a table-based approach to solve problems by building up from smaller solutions. This approach is commonly used for more complex tasks like certain scheduling, pathfinding, or optimization problems. Adit's chapter on dynamic programming in Grokking Algorithms took months of work, but it simplifies tough topics by using 2D table examples.
- Links and Tools:
  - Intro to Dynamic Programming (Khan Academy)
K-Nearest Neighbors for Recommendations The simplest idea behind recommendation engines like Netflix is to find your "nearest neighbors", people who rate content similarly, and suggest items that those neighbors love but you haven't seen. This is known as the K-nearest neighbors (KNN) approach in machine learning. While not perfect for all scenarios, it's effective for quick, straightforward recommendation systems.

Links and Tools:
- scikit-learn KNN docs

Interesting Quotes and Stories

"When I want to solve a problem, I ask myself: Can I solve it with a hash table or a graph? 90% of the time, one of those two approaches gets me there." -- Adit

"If you struggle with the core idea of an algorithm, try a playful example, like rummaging around in boxes, or bus stops connecting from Twin Peaks to the Golden Gate Bridge." -- Adit

Key Definitions and Terms

Big O Notation: A way to describe the worst-case or average-case growth rate of an algorithm's runtime or space usage.
Recursion: When a function calls itself to break a problem down into simpler parts.
Greedy Algorithm: Picks the best immediate choice at every step, aiming for a reasonable (not always perfect) solution.
Dynamic Programming: Solves problems by combining solutions to smaller subproblems, storing those intermediate results to avoid recomputations.

Learning Resources

Python for Absolute Beginners: If you're just starting your Python journey, this course provides the fundamentals of the language in a structured way.
Write Pythonic Code Like a Seasoned Developer: Grow your fluency in Python and learn to write code that leverages Python's best features and idioms.
Adit Bhargava's GitHub with images from Grokking Algorithms
Official Python Docs on Data Structures

Overall Takeaway

Algorithms are the cornerstone of solving problems efficiently in programming. By combining creative visual examples with clear, concise explanations, Adit shows that challenging topics like Big O notation, divide and conquer, and dynamic programming can be accessible. As you explore these algorithms, remember that knowing the right tool, whether it's recursion, hash tables, or graph search, can transform how quickly and elegantly you solve day-to-day coding problems. Above all, don't underestimate the power of concrete examples and illustrations to lock in these concepts for the long haul.

Episode Transcript

Collapse transcript

WebVTT format On GitHub

00:00 Algorithms underpin almost everything we do in programming and in problem solving in general.

00:04 Yet many of us have partial or incomplete knowledge of the most important and common ones.

00:09 In this episode, you'll meet Adit Bhargava, the author of The Light and Playful Grokking Algorithms, an illustrated guidebook.

00:16 If you struggle to understand and learn the key algorithms, this episode is for you.

00:21 This is Talk Python To Me, episode 82, recorded October 24th, 2016.

00:27 Welcome to Talk Python To Me, a weekly podcast on Python.

00:57 The language, the libraries, the ecosystem, and the personalities.

01:00 This is your host, Michael Kennedy. Follow me on Twitter where I'm @mkennedy.

01:04 Keep up with the show and listen to past episodes at talkpython.fm.

01:08 And follow the show on Twitter via at Talk Python.

01:10 This episode is brought to you by Capital One and Intel.

01:15 Thank them both for sponsoring the show by checking out what they're offering during their segments.

01:19 Adit, welcome to Talk Python.

01:22 Thank you.

01:23 I'm really enjoying your book that I started reading a little while ago, Grokking Algorithms

01:28 with Python.

01:29 And I think we're going to have a great time talking about it.

01:31 But before we do, tell us your story.

01:33 How did you get into programming in Python?

01:34 Well, I think I have a slightly atypical route.

01:38 Because I got into programming with this thing called Thing Maker.

01:43 I don't know if anyone remembers this.

01:46 It was, you know, back in the day, WWE had put it out, like the World Wrestling Entertainment

01:53 guys.

01:54 Okay.

01:55 Yeah.

01:55 And it was just a simple way to, like, make little games or, you know, make a website.

02:00 And I started playing around with that.

02:03 And then I thought, you know, I would love to make my own games.

02:06 And then I started playing with Flash.

02:08 So now you know how old the story is.

02:11 Yep.

02:12 Flash was probably new and exciting at the time.

02:14 It was.

02:16 And, like, it wasn't even owned by Adobe.

02:18 It was owned by Macromedia.

02:20 And Macromedia Director was a big thing.

02:24 Anyway, so I, like, started, you know, started with ActionScript.

02:27 And I sold a game.

02:29 And that's when I thought, you know, I could do programming.

02:32 This is really fun.

02:33 Oh, yeah.

02:34 That's cool.

02:34 So that's a ways away from Python.

02:37 How did you get over to the Python world?

02:39 Eventually, I was, like, because I had learned C, C++, kind of the standard route that a lot

02:45 of people take.

02:45 And I just wanted to get back to those Flash days when, you know, I mean, to me, I would

02:54 rather work with a simple language that can execute my ideas.

02:58 And that's when I found Python.

03:01 And it was so easy to use.

03:03 You know, it's like you can think about something and type it out as you're thinking about it.

03:09 Yeah, that's cool.

03:09 It's like programming is fun again, right?

03:11 Yeah.

03:12 Exactly.

03:13 It reminds me of that XKCD Python.

03:15 All right.

03:17 Well, that's excellent.

03:18 Did you go to a university and do a computer science degree?

03:21 I actually did my undergrad in graphic design.

03:25 Okay.

03:26 Because, you know, I mean, I love drawing.

03:29 It's been a hobby for me for a long time.

03:31 And then I worked as a designer for a while.

03:35 But I realized that I would enjoy programming more as a job.

03:40 So then I went to UChicago and got a master's in computer science.

03:45 Okay, cool.

03:46 Yeah, I guess that makes a lot of sense.

03:47 Like the illustrated part of your book, which we're going to get to, which is very cool.

03:52 You know, there's a lot of things you can learn.

03:54 You can just self-teach yourself about programming.

03:58 You know, there's a lot of good boot camps.

04:00 There's a lot of good online classes.

04:01 Things like that.

04:02 But I do feel one of the shortcomings of that type of education really is around the proper use of data structures and algorithms.

04:12 And not necessarily being able to write them, but knowing sort of the trade-offs, right?

04:17 Yeah, exactly.

04:18 You decided to solve that problem.

04:19 And you took your graphic arts skills and applied them, not just your programming skills, but your graphic arts skills,

04:26 and applied them to this problem with this book you wrote called Grokking Algorithms,

04:31 an illustrated guide for programmers and other curious people.

04:34 Yeah.

04:36 So I read your book, a good portion of it.

04:39 I find it delightful.

04:40 I think it is really, really nice.

04:43 You know, there's a lot of these concepts that if I remember reading algorithm-type books,

04:48 and it's just, you know, you're about to fall asleep.

04:51 It's just so dense and so hard to make it real, right?

04:55 It's like, I don't know, it's just tough to grok them, I guess, right?

04:59 And your book is filled with playful examples and fun pictures and things like that, right?

05:05 What was inspiration for that?

05:07 This whole thing started because I wrote a blog post on Haskell, and it's this blog post about monads, which is a tough concept for a lot of people to understand.

05:18 The post got really popular, and still, you know, to this day, I see it being tweeted.

05:25 That's when, you know, I was like, you know, using pictures can really make a big difference

05:30 when you're trying to explain hard concepts.

05:32 So that's where I started with the book.

05:36 It was like, I would love to do, because I've always known I would love to do a book on algorithms.

05:41 Like you said, there's all the ones that I've read so far are so hard to read,

05:46 and I had a very good algorithms professor at Chicago, and she made things so easy.

05:53 So, you know, that's when I thought it would be great to have a book that makes it as easy as she made it for me.

06:00 And it just, you know, it just seemed like a good fit.

06:03 You started your blog at audit.io, and I'll be sure to link to that, right?

06:09 And you started doing these visual sort of explanation posts.

06:12 Yeah.

06:13 Did like, so this book is published by Manning, is that right?

06:16 Yes.

06:16 Yeah, so did Manning reach out to you and go, hey, this illustrated idea is really cool,

06:21 or did you approach him and say, I'd like to do something more torturous to myself?

06:26 No.

06:27 They reached out to me.

06:29 It was just the craziest.

06:30 After that, I kind of joked, like, this is how I get success.

06:34 I just sit back and wait for people to come to me.

06:38 There's a lesson there, though, I think, and that's really cool to hear.

06:41 I think there's a really important lesson there that you don't know what opportunities are out there or how they're going to come to you.

06:49 But if you don't ever put yourself out there and you don't try to do things in public, they're absolutely not going to come, right?

06:56 Like, there's so many opportunities have come to me from because of the podcast, but I never even imagined that they would.

07:01 I wasn't looking for them, and I didn't do the podcast for them or anything like that.

07:05 But just, like you said, being out there makes a big difference.

07:08 I agree.

07:09 And I put in so much work into those blog posts.

07:12 Like, it used to take me weeks to write a single post.

07:16 And it's kind of like just throwing a note in a bottle, right?

07:20 Like, you don't know if anyone's even reading it or even cares.

07:25 But somehow, you know, I mean, I enjoyed it.

07:28 That's why I kept doing it, and things just worked out.

07:31 So your book on algorithms is written in Python, and you could have chosen many languages, Java, JavaScript, and so on.

07:38 Why Python?

07:39 It's just the easiest language for me.

07:41 It's the easiest to learn.

07:43 I used to teach a course on Python, and it was just people picked it up so quickly.

07:48 Yeah, I totally agree with that.

07:50 You know, I was thinking earlier as you were talking about you want to get back to the ActionScript days and things that are easy.

07:57 A lot of people know JavaScript, but I feel like JavaScript has, by way of its popularity, become hard again.

08:03 You know, like all the Node.js dependencies, all the layers piled on top of it.

08:10 Somehow, Python has managed to avoid that fate, which keeps it simple but powerful.

08:14 I think it's great.

08:14 I'm constantly confused by JavaScript.

08:17 I think it's a...

08:18 I love JavaScript because it has a lot of those functional parts that I really like.

08:24 But yeah, the JavaScript community is moving so fast, it's hard to keep up.

08:28 Yeah, for sure.

08:30 So one thing that you said when you were talking in the preface of your book, you said that for everything that you describe, you like to lead with examples.

08:39 Yes.

08:41 Well, so this is part of my...

08:44 I mean, I have this, you know, some ideas about like what it means to teach something well.

08:49 And I think one big piece of that is you have to make things concrete.

08:55 I'm reading this book on probability right now.

08:58 And one thing that really frustrates me about this book is they'll start with a simple problem.

09:05 So they'll say, you know, for example, one of the problems in the book is this classic random walk idea.

09:11 So you have a man standing at the edge of a cliff.

09:15 And there's a one-third chance that he'll step towards the cliff.

09:19 And there's a two-third chance that he'll step away from the cliff.

09:23 And he's drunk, so he doesn't know which direction he's going to step in.

09:27 So, of course, if he takes a step towards the cliff, he's just going to fall off.

09:30 But there's a two-third chance he'll step away.

09:32 And, you know, again, there's a one-third chance that he'll step towards, two-thirds that he'll step away.

09:36 So the question is, does he escape the cliff or does he eventually fall off?

09:42 So a pretty good probability problem, you know, because it's simple to state and pretty hard to solve.

09:50 But then I read their answer in the back and they're like, okay, let's generalize this problem.

09:55 And let's say X is the probability of him stepping towards and Y is the probability of stepping away.

10:02 And I was like, no, you have concrete numbers.

10:05 Isn't it so much better to be able to visualize the problem as you're stepping through it?

10:10 And the only way to do that is by having a concrete example.

10:13 Yeah.

10:13 Yeah, I agree.

10:14 I feel like a lot of things, the concreteness of it and sort of the reason the whole thing came to be interesting in the first place gets, it kind of gets sterilized and all that stuff gets lost.

10:27 And you're down to just the essence of the abstract problem.

10:30 But then the joy of the problem is kind of gone in some ways, right?

10:33 So I really like that you have examples.

10:35 And these are like really playful examples.

10:38 I think it's important to start with making things concrete and then you can get into the theory and the abstract and so on.

10:45 Yeah.

10:46 I think I also try to choose pretty real world examples so people can, you know, look at the problem and think about all the different ways they could use it in their work.

10:57 Yeah.

10:57 I really like the one about Maggie and the checkout counter.

11:00 We'll get to that one.

11:01 Thank you.

11:03 So I have a pretty diverse listenership and people, a lot of people have, you know, PhDs in computer science, but a lot of people have come from different areas.

11:12 So just real quickly, what, tell us, what's an algorithm for those of us who don't know?

11:16 An algorithm is a set of instructions for accomplishing a task.

11:20 So really any piece of code could be an algorithm, right?

11:24 But there are certain pieces that are more interesting, maybe because they solve a hard problem really well, or maybe they can be used at a lot of different places.

11:35 So that's really what we talk about when we say algorithms, you know.

11:40 Yeah.

11:40 Okay.

11:40 I totally agree.

11:41 I think it's the reusability is definitely an interesting part.

11:44 Like the way you might log something to disk might be an algorithm, but it's not reusable in any way because it's exactly specific to a thing.

11:51 So you wouldn't call it an algorithm per se, right?

11:54 Yeah.

11:54 Cool.

11:55 All right.

11:56 So in your book, you broke it down to, like you said, there's going to be two major things that you learned.

12:00 One, you said you're going to learn about performance and the other, you're going to learn about solving problems.

12:05 How's that work?

12:06 The solving problems one is the really big one that I focused on.

12:11 Back when I was starting out, I just used to run into walls sometimes where you like come across a problem and you just don't know how you're going to solve it because you've never solved a problem like that before.

12:23 Or you don't really have a lot of tools in your tool belt.

12:28 So you can't say like, oh, maybe I'll use a hash table for this and see if that works.

12:33 Or maybe I should try to use some kind of a machine learning approach.

12:38 If you, you know, there's this idea of you don't know what you don't know.

12:44 And if you don't know that those tools exist, you're just stuck.

12:47 You can't solve those problems.

12:48 So this book is really about giving you all those tools like that should be able to solve most of the problems you encounter day to day.

12:58 Yeah, that's really cool.

12:59 And some of the examples you give are like, if you're making a video game, you could learn how to create an AI system that can find its way through pass or a recommendation system kind of like Amazon.

13:10 So that you might like, or even things that knowing that there are problems that are basically not solvable in a quick way.

13:18 Yeah, those NP complete problems.

13:21 Yeah, yeah.

13:21 That's a whole different interesting area.

13:23 I think performance is also interesting.

13:26 I mean, you talked about hash tables just a minute ago and dictionaries.

13:31 Like, knowing that if I have this type of thing I need to do, it's 100 times faster using this data structure than that data structure and being able to express that.

13:42 It helps you design how you solve a problem.

13:45 If you're like, you know, if I can coerce it into this shape, it will be rocket fast.

13:50 Absolutely.

13:51 That's something, to me, that is really about the craft of engineering.

13:56 One reason I love Python is you can write beautiful code with it.

14:02 And I think thinking about performance is just another aspect of that craft where you like, you know, you want to write something that runs well, that's dependable.

14:13 And that means it should run as fast as it can run.

14:16 Yeah.

14:16 Yeah, up to the point where it becomes, like, you can optimize something so much that it becomes write only.

14:25 Like, you wrote it and it works, but you can't understand it and nobody else can certainly understand it, right?

14:29 Oh, yeah.

14:29 So there has to be this tradeoff of maintainability.

14:33 But, yeah, knowing the data structure definitely is the right in algorithms and how that works in terms of performance is key.

14:41 So the algorithms, your algorithm book is not, like, comprehensive.

14:45 It's not like an encyclopedia of all known algorithms or anything like that, right?

14:51 It's more a special list.

14:55 You said the topics that you cover are all things you've used at work, which is pretty cool.

15:00 Exactly.

15:01 I think if you're looking for an encyclopedia, like, you could just read the Wikipedia articles or, like, one of those algorithms books that are, like, a thousand pages long.

15:10 You know, I didn't want to, I didn't want my book to be, like, we printed out Wikipedia and bound it into a book format.

15:18 Yeah.

15:18 Thank you for not doing that.

15:21 So we're going to talk in depth about some of the algorithms and stuff.

15:25 But can you just give us, like, a quick flyover of what's covered and things like that?

15:30 Sure.

15:30 So the first part of the book is, like, foundational work that I'll be using in the rest of the book.

15:37 So chapter one introduces you to a very basic algorithm and then talks about big O notation, which is something a lot of people get very confused by.

15:47 But it's an important concept.

15:49 And then chapters, chapter two is about memory.

15:53 Chapter three is about recursion.

15:55 So you can see I'm kind of building the foundation, like, the very basics.

15:59 And then we get into the interesting stuff.

16:02 So, like, chapter four, this is where I introduce one of those tools that I was talking about called divide and conquer, where, you know, you get a problem.

16:12 I don't know how to solve this.

16:14 Well, you could think, like, maybe I'll use divide and conquer and see if that works.

16:18 Chapter five is hash tables, which is the most useful tool in my tool belt.

16:25 I totally agree with you.

16:26 Yeah, you had a good quote.

16:27 You said, when I want to solve a problem, there are two planes of attack I start with.

16:31 Like, can I make this hash table problem?

16:34 Can I solve it with a graph, maybe?

16:36 One of those two, right?

16:37 Exactly.

16:37 I feel like, I mean, this is kind of a secret tip.

16:42 But when I get into an interview and someone gives me a problem, I immediately just think hash tables or graphs.

16:50 And it works, like, 90% of the time, you know.

16:54 Nice.

16:55 So, yeah, obviously, I think graphs are really important.

16:57 So chapter six and seven are graphs, another very useful tool.

17:02 Chapter eight is another good tool, and it's so easy.

17:06 Like, this might be the easiest chapter in the book, because chapter eight is really, like, do the dumbest thing you can think of, and it'll probably work.

17:17 And that's what a greedy algorithm is.

17:20 Yeah.

17:21 Sometimes, you know, computers are fast.

17:22 Sometimes the problems aren't that big.

17:24 Just solve it simply, right?

17:26 Exactly.

17:27 Chapter nine is probably the one I'm most proud of and the one that people have complimented me the most on,

17:36 because it's dynamic programming, and it is this really hard way to solve things.

17:42 That's not the same as programming with a dynamic language versus static languages.

17:47 It's something different, right?

17:48 Yeah.

17:49 It's all about creating this 2D array and splitting a problem up into sub-problems and then using those solutions for those sub-problems for the bigger problem.

17:59 It kind of, I feel like it just blows people's minds.

18:03 It's just so hard to use.

18:04 I mean, I spent maybe three or four months on this chapter alone, and I think it's really well-written, and a lot of people have found it very useful.

18:15 So that's chapter nine.

18:16 And then chapter 10, I just wanted to put a little thing about machine learning in here, because key nearest neighbors is so easy to use, and it's so effective.

18:29 And it was just, you know, you could read this chapter before bed, and you would know how to build a recommendations engine.

18:35 Oh, that's great.

18:35 That's really cool.

18:36 Yeah.

18:37 And then you finish out with saying, basically, now that you're inspired, you know the foundational algorithms.

18:43 Here's a whole bunch more things you can go do, right?

18:45 Yeah.

18:46 And then there's 10 things in chapter 11 that I think are really neat, and they're just, oh, I wish I could have put all 10 of these in the book also, but I just gave a little tidbit about each one.

19:01 Sure.

19:02 That'll be the follow-on, the second version.

19:06 Yeah.

19:06 So one thing that I liked about your book is you have little short exercises, like a couple minutes, two or three minutes at the end of each section, each chapter, that you can do to sort of test your thinking.

19:19 And I found those to be really nice.

19:20 Thank you.

19:21 I've been reading this book called A Book of Abstract Algebra, and it's got a great model where all the chapters are like two pages followed by six pages of exercises.

19:32 I found that those exercises helped me so much, so I figured, you know, I need to put more exercises in my book.

19:38 So who did you have in mind?

19:40 Like, if I have no programming or very little program experience, would this book be useful?

19:45 If I had a computer science degree, would it still be useful?

19:47 So it's been really popular with boot camp students so far.

19:52 I have a few friends who went to boot camps, and, you know, I have a friend who's in a boot camp right now, and he kind of told other students in the camp,

20:02 about it, and they all seem to like it.

20:06 I think it's also useful for people who have a computer science degree.

20:13 So I work at Etsy, and I kind of, you know, just sent an email out when the book came out saying, like, hey, I've published a book.

20:19 You know, isn't that cool?

20:22 And I've had people email me, like, really senior people, you know, people who are, like, two levels above me at Etsy.

20:30 And they're like, wow, I, finally I have great examples to explain these concepts with.

20:37 Because it's so hard to come up with a good example to explain something.

20:41 Yeah, absolutely.

20:41 It's one thing to know it.

20:42 It's another to teach it to another person.

20:45 And I think it is really, I think it's really accessible in that way.

20:49 Thank you.

20:49 It also doesn't require a lot of math, which I like.

20:52 You know, you don't need calculus or linear algebra or anything.

20:57 It's really, like, if you know the basics of algebra, like, if you know what a function is, you can read this book.

21:05 Yeah.

21:06 I like that you said that.

21:07 You know, I was talking to a friend of mine about sort of what does it take to become a programmer?

21:12 Like, how much math do you need to know?

21:15 And I think there's an outside of the programming field, like, people who are not programmers,

21:20 I feel like programming is very, very mathematical, right?

21:24 Like, if you don't know how to do calculus and differential equations, you'll never be a good programmer.

21:29 And to be honest, I find I do very little actual advanced math in anything that I do,

21:34 except for when I was working for, like, a scientific visualization company.

21:38 But outside of that context, like, it's more just knowing, like, critical thinking and problem solving.

21:43 And so I think that they give people feel like they need to learn a lot of math.

21:47 I mean, math is not a bad thing, but I don't think it's required.

21:50 Do you?

21:51 I don't think it's required either.

21:53 I think, especially for something like this, where, like, algorithms are the first step to, like, long journey in computer science.

22:02 And that first step should be as easy as possible.

22:06 There shouldn't be, like, big hurdles to get to that first step.

22:10 Yeah, totally agree.

22:11 Totally agree.

22:12 Okay, cool.

22:13 So let's talk about some of the algorithms and some of the things that you covered in the book.

22:18 The very first one that you covered was sort of different types of searching in sets and really leading towards binary search.

22:28 Yeah.

22:28 This is my favorite algorithms example because anyone can understand it.

22:36 Like, you don't even need to know anything about computers.

22:42 But because a lot of people, you know, when they heard I'm writing a book, they were like, tell me what is an algorithm.

22:48 So I gave them this example where I was like, you know, think of a, I'm thinking of a number between 1 and 100.

22:56 Take a guess.

22:56 And they would say, you know, 55.

22:58 And I'd say, well, that's too high.

23:02 Take another guess.

23:03 And they'd say 30.

23:04 And that's interesting because people automatically divide the space in half when they guess, almost on instinct.

23:13 Like, maybe they don't even understand what they're doing, but they're running binary search in their mind.

23:20 You know, that's all binary search is.

23:21 Divide the space in half every time to get to the results or to find the result as quickly as you can.

23:27 Yeah, it's such a simple example.

23:28 And yet I think it really dramatically points out the power of taking that, formalizing it just a little bit and making it really fast.

23:38 Capital One has a special message for you.

23:55 They need Python pros who love to work with data.

23:57 Put your Python experience at work at Capital One and help them use data to make life better for millions of customers.

24:03 Capital One is employing the latest tools and approaches to do data analytics and data science

24:08 from the ground up.

24:09 They're smart, creative professionals who love to explore new ways to interact with data.

24:13 They're interested in figuring out novel, advanced Python techniques, and even more interested in finding more people who will help them do that.

24:20 When you join their state-of-the-art Python community, you'll work with people you really like,

24:24 people who might be listening to this podcast right now.

24:26 Relentless innovation is their way of life.

24:28 Make it yours at Capital One.

24:30 Visit jobs.capitalone.com slash talkpython to learn more and apply today.

24:37 You had a cool example.

24:41 You said, you know, imagine that you play this.

24:44 You have a couple of ways which you could do it.

24:47 Like you could just say like, I've got a list of 100 things and you want to search for something in there.

24:52 Right?

24:53 Kind of like the example you just gave.

24:55 One option is to look at the first one, check it next one.

25:00 Just go through it in order, right?

25:02 Another way is to apply this binary search thing, assuming that set is somehow ordered like an address book or something.

25:08 And you said, well, for 100 items, if you go straight through it, you know, worst case, you'll have to go,

25:13 you'll search until you get to the end.

25:14 That's 100.

25:15 But if you use the binary search because you half it and half it and half it, worst case scenario, you get seven.

25:20 But then you said, well, let's think about it, four billion.

25:23 Yeah.

25:24 And it's so crazy because now the difference, you know, now it's really obvious.

25:29 Like for searching through four billion items is too much.

25:33 But with binary search, it's just 32.

25:36 Maximum of 32 guesses, right?

25:39 Even if it's I'm guessing between a number between zero and four billion.

25:42 Yeah.

25:43 Isn't that such a huge difference?

25:45 Yeah.

25:46 It's it's insane.

25:47 I mean, it makes sense.

25:50 But at the same time, you don't think such a simple idea is going to reduce it from four

25:55 billion comparisons to 32 at worst case.

25:58 Yeah.

25:58 It's really amazing.

25:59 Yeah.

26:00 Yeah.

26:00 So one of the things I think is hard for people who have not gone through a formal computer science

26:06 background and just for everybody listening, I don't have a computer science degree, but

26:09 I have a minor in computer science.

26:11 So I've gone through some of it.

26:12 So I guess I'm somewhere in the middle of this.

26:13 But if you are self-taught, if you've gone to a boot camp or if you're just really new,

26:19 people often talk about big O notation around performance of algorithms.

26:24 And I think that that's kind of mysterious to people.

26:27 And it also seems to be something that ends up on job interviews often.

26:32 And so if you don't have that experience, you're like, I don't even know what big O is.

26:36 Like, well, sorry, you're out or something, right?

26:40 Like it can be a bigger problem than it maybe really deserves to be.

26:45 But I think it's worth knowing big O notation for a couple reasons.

26:48 Yeah.

26:49 Dude, this was a really good way for you to introduce that concept, I thought.

26:52 Like, pretty crazy, right?

26:53 Yeah, exactly.

26:55 And I won't, you know, get into the full explanation here.

26:59 But I do want to say, I think every beginner engineer I meet has trouble with big O notation.

27:06 So, you know, if some of your listeners are still new to engineering, just to say, like, that's not just you.

27:14 Don't feel that.

27:15 That's right.

27:15 Yeah.

27:16 But that's why I, you know, it's right up front in the book.

27:20 And then I talk about it again in chapter four.

27:24 So, I spend a lot of time trying to explain big O notation in this book.

27:29 Yeah, absolutely.

27:31 So, I think it's pretty interesting.

27:33 You've got the linear search, which is what they call O of N.

27:38 So, as if you have N items, you have to do N comparisons.

27:41 If you have two N items, you have to do two N comparisons.

27:44 It grows basically linearly.

27:45 But this binary search one is log of N, which doesn't sound like that big of a difference.

27:52 So, you realize it's 4 billion versus 32.

27:54 You know?

27:55 Which is pretty amazing.

27:57 So, knowing this relative scale, that doesn't actually tell you how fast it is, does it?

28:03 That just tells you, like, relatively how much slower does it get as you get more data.

28:08 Exactly.

28:09 Because everyone's computer, you know, calculates at a different speed.

28:12 So, you can't put a time on it.

28:15 Yeah.

28:15 And that becomes interesting later when you find algorithms that are actually look worse in big O, but often they're not.

28:24 And so, you talked about this idea of average time versus worst case time.

28:29 I think that's also important to understand.

28:32 Yeah.

28:33 That's another really interesting one where, you know, if you can say that your algorithm is going to take a short time on average and a really long time, worst case, maybe that's fine.

28:48 And, you know, if you're just a website or, you know, you have a basic consumer app and you're like, well, it'll run fast most of the time.

28:56 So, that's fine.

28:57 If you're NASA and you have to guarantee a certain time, then you really care about that worst case time also.

29:05 Yeah.

29:05 Any real-time system.

29:07 So, if you were doing like a flight control system on a spaceship or if you're doing trading in some sort of high-speed trading system or I worked on a system that would actually analyze eye tracking data in real time.

29:22 And it would get 250 samples per second.

29:25 And if it couldn't process using some very advanced sort of wavelet decomposition algorithms and whatnot.

29:32 If you couldn't process that in, you know, four milliseconds.

29:35 Well, then it just couldn't keep up because that was how fast data was coming, right?

29:39 I mean, there's these situations where worst case time maybe is super important.

29:44 But a lot of times, like you said, average time I think is fine.

29:47 So, give us some, like for some algorithms that we might know, give us a big O performance stats.

29:54 Sure.

29:55 So, binary search, we already talked about log N and searching, you know, linear search, looking at one item at a time is big O of N.

30:06 And like we just talked about, that's a big difference, right?

30:10 Log N versus N.

30:11 So, again, if you think about that, a slow sorting algorithm is N squared.

30:17 And my example is selection sort.

30:20 A fast sorting algorithm is going to be N log N.

30:23 So, again, you have that log N versus N difference.

30:27 So, the fast sorting algorithm is much faster.

30:29 Yeah.

30:30 And after that, there's like N cubed algorithms.

30:34 My extreme example is you can get big O of N factorial algorithms.

30:42 And, you know, if people don't know what a factorial is, that's like 5 factorial would be 1 times 2 times 3 times 4 times 5.

30:51 6 factorial would be 1, 2, 3, 4, 5, 6.

30:55 Factorial grows really fast.

30:57 If you have an N factorial algorithm, that just means you just can't use it most of the time.

31:04 It only works on extremely small data sets, right?

31:07 Yeah.

31:07 So, the example, probably the canonical example for that is the traveling salesperson problem.

31:12 Yep.

31:13 And this is one of those NP complete problems we have talked about where, you know, the problem is really simple.

31:20 You're a traveling salesman and you have a list of cities that you want to travel to.

31:27 And you want to figure out the shortest route that hits all of those cities.

31:31 So, it just seems so simple.

31:34 Like, you know, why couldn't you calculate that?

31:37 And, of course, you can.

31:38 It means that you have to come up with every permutation of cities, of the order of cities.

31:45 Which, if you have 6 cities, it's 6 factorial permutations.

31:50 If you have 100 cities, it's 100 factorial permutations.

31:56 And just to give you an example of how crazy that is, 6 factorial is 720, which, you know, your computer can do.

32:05 100 factorial is 9 followed by 157 zeros.

32:12 Wow.

32:14 Yeah.

32:15 That means you're going to run out of time or you're going to run out of memory.

32:19 One of those first.

32:20 But you're probably going to run out of something, right?

32:21 Yeah.

32:22 I think the universe will end before you can make that.

32:26 Absolutely.

32:28 All right.

32:29 So, the next thing that you talked about that I thought was cool was selection sort.

32:33 And you took a moment to say, like, let's think about the two data structures that hold stuff in just like a list style.

32:44 And that was linked lists and arrays.

32:46 And I thought that was a really interesting tradeoff, knowing how you're going to use them and so on.

32:52 So, tell us about that comparison you made.

32:54 Sure.

32:55 So, the example I use in the book is this idea of, you know, you're going to watch a movie.

33:01 And let's say you're going there.

33:04 There's eight of you.

33:05 Eight people are going.

33:06 And you're trying to find seats.

33:08 So, maybe you can find eight seats all together and then you can all sit together.

33:14 Maybe there's no set of eight seats together.

33:17 So, you have to kind of sit all over the theater.

33:21 And you know where each other, you know where your group is, but they're not all in one place.

33:27 And that's this idea of linked lists versus arrays.

33:30 Where arrays, you're all sitting together.

33:33 Link lists, you're all sitting separately.

33:37 You know, your data is together in memory or apart.

33:40 So, with arrays, you basically have a contiguous block.

33:43 Exactly.

33:44 And with linked lists, each element knows how to find the next element.

33:50 Sometimes you have doubly linked lists.

33:51 So, you can start at the back and go forward or forwards and go backwards.

33:53 But each element is more or less in charge of going to a new memory location to find the next or be in the end.

33:59 Exactly.

34:00 And if I could stretch this movie analogy a little bit.

34:04 Let's say, you know, you have a bag of popcorn.

34:07 And you're kind of passing it down the row.

34:10 Really easy to do if you're all sitting together, right?

34:13 Because you can just, you know, the next person is just to your right.

34:17 And you just keep passing the bag to your right.

34:19 And that's what arrays are.

34:23 So, it's really easy to access the next element in an array.

34:27 And it's easy to say, like, you know, I want to find the fifth item in my array.

34:35 Because every once together, you can just do the math.

34:38 Like, zero plus five equals five.

34:40 Linked list, it's like, now you're passing this bag of popcorn around.

34:45 You have to go to the next person in the movie theater.

34:47 And then they have to go to the next person.

34:49 It's a little more arduous.

34:51 And if you want to find the fifth person, you can't just go directly to that person.

34:56 You have to go to the first one.

34:58 And the first one has to go to the second one.

34:59 Second one has to go to the third one.

35:01 So, you have to, like, follow these links down.

35:04 Yeah, that makes perfect sense.

35:06 So, I mostly, if I think of the data structures I use, I mostly use arrays.

35:11 So, lists.

35:12 And dictionaries.

35:14 And then sometimes I want distinct stuff.

35:16 So, sets.

35:16 I don't find myself using linked lists so often.

35:19 But they do have some interesting trade-offs.

35:22 Like, when is an array good versus when is a linked list good?

35:25 So, again, going back to this movie example.

35:28 Sometimes you go to the theater and you just don't have eight seats together, right?

35:34 Like, sometimes you just can't fit an array in memory because you don't have, you know,

35:41 if your array, if the array you want to create is too big, you just don't have space for it.

35:46 Or, what can be bad also is, let's say, eight of you sits down at the theater and you found

35:55 eight seats, everything's great, and now another person shows up.

35:59 So, now you have nine people.

36:00 But there's no space for a ninth person.

36:04 So, now you have to all get up and go around trying to find that ninth seat.

36:10 So, you know, similarly, when you want to add elements to an array, let's say you allocated

36:16 memory for 100 elements.

36:17 And now you want to increase the size of your array to 200.

36:21 Well, it's going to be a lot of work to move those, all those items to a different part of

36:27 memory.

36:27 You know, that's bad performance set.

36:31 Right.

36:31 If you have a linked list.

36:32 Or if I want to insert one in the middle, something like that, right?

36:35 Exactly.

36:35 Like, it's hard to move all those items.

36:38 But for the linked list, you can just put them somewhere and just change the links around.

36:42 Yeah.

36:43 So, you talked about the big O performance of both of those.

36:47 And basically, inserts for lists are O1.

36:51 So, constant, like super fast.

36:54 But, like, random access is order in, which is not so great.

37:00 But it's almost the reverse for arrays, right?

37:03 Random access is just instant, more or less.

37:06 But adding something grows as you have more items, right?

37:10 Because you've got to copy and reallocate and all that.

37:12 So, they're almost like counterpart, like opposites in some way from a performance tradeoff.

37:17 Exactly.

37:18 And that's kind of, you know, you hear all that and you start thinking, gosh, I wish I had something that was as good as arrays for reads and as good as link lists for inserts.

37:31 And that's when you start getting into, like, the more complex data structures.

37:36 Right.

37:36 Absolutely.

37:37 So, another thing that you covered that I recall, like, this has burnt a spot into my brain from when I learned it is recursion.

37:47 And I just remember recursion, like, blowing my mind when I first thought of problem solving with recursion.

37:53 Oh, my gosh.

37:54 Yeah.

37:54 This is another one that's so hard for people to start thinking about it because it's just, you know, a function calling itself.

38:05 That just seems crazy.

38:07 But that's why I have a lot of examples about recursion and I have a lot of exercises.

38:13 And I kind of try to break it down so people understand the structure of a recursive solution.

38:21 Even if you never plan to use recursion in a problem, there are plenty of algorithms that other people have created that use recursion.

38:30 So, if you want to understand those algorithms, you need to know what recursion is.

38:35 Yeah, absolutely.

38:36 And there are times when you can solve a problem without recursion.

38:39 But the data you're trying to understand is so perfectly lined up for recursion that the solution is just dead simple.

38:47 If you realize that that's something in your toolbox, right?

38:50 Like tree, like depth first sort of tree type processing and things like that.

38:54 Exactly.

38:54 Yeah.

38:55 So, one thing that you had at the beginning was this example with boxes.

38:59 And you have like a little box story in the attic for loops and recursions.

39:05 Yeah.

39:06 Yeah.

39:06 So, this was a toy example where, you know, you're going to your grandma's attic and you're looking for this key and she has so many boxes.

39:17 And it could be among these boxes.

39:19 So, you open a box and then you see more boxes inside that box.

39:25 So, now you can think about, you know, there's two ways you could find this key.

39:30 You could kind of keep this list of boxes, right?

39:34 So, like you open a box, you see more boxes and you just add them to your pile of boxes to check.

39:40 And you, the algorithm you're running is you pick up a box from the pile, look for the key.

39:50 If you see some boxes, you add it to the pile.

39:54 And until you find the key, you just grab another box from the pile and check it for the key.

39:59 And that's the while loop approach, right?

40:02 Because while you don't have the key, go to the pile, pick up a box, search for the key.

40:09 And the recursive approach would be open a box.

40:13 If there's a key, you're done.

40:16 If there's a box, open the box.

40:19 If there's a key, you're done.

40:21 If there's a box, open the box, you know.

40:23 Yeah.

40:24 It's got this beautiful, very simple quality to me where you can express it in two lanes.

40:30 Like, if key, done.

40:32 Else, keep going.

40:34 Do it again.

40:36 Do it again.

40:36 Just open the box and see if there's a key in it.

40:39 Yeah, it's got this very natural way of solving the problem, doesn't it?

40:42 That's cool.

40:42 Yeah.

40:43 We all love Python for its tremendous productivity benefits.

41:01 But getting the best performance takes some work.

41:03 What if you could get out of the box, easy access to high-performance Python?

41:08 Intel distribution for Python developers delivers just that.

41:11 Get close to 100 times better performance for certain functions when using NumPy, SciPy,

41:16 scikit-learn, linked with optimized native libraries like Intel Math Kernel Library,

41:21 access-efficient multi-threading, and Python projects like Numba and Scithon.

41:25 Try the Intel distribution for Python and experience performance today at talkpython.fm/Intel.

41:31 And profile your Python and native C, C++ applications for performance hotspots with Intel VTune amplifier.

41:39 With Intel, it's all about performance.

41:49 I like how you've got some nice pictures and the pictures are even simpler.

41:53 It just feels really great.

41:56 The other thing that was interesting was in your first example, you talked about having this list of boxes.

42:04 And you put the stuff in the list and you take the stuff out of the list.

42:07 In the recursion example, you don't have anything that is the storage of where you are or what box you're working on or anything like that.

42:16 Like, how do you keep track of the boxes?

42:18 So that's a really interesting part of recursion, where you're kind of making the computer do the work for you, right?

42:27 Because you call, let's say you call the look for key function.

42:32 So you've called it once, and the computer has that, you know, that information noted.

42:39 Like, okay, he's called look for key once.

42:42 And then you call it again inside look for key.

42:45 So the computer says, okay, that's the second call to the look for key function.

42:51 And then you call it again, and it says, okay, that's the third call.

42:55 So it's kind of keeping track of those calls for you.

42:58 And those function calls is your array, basically.

43:03 You're keeping track of all the boxes you have to check through that array of function calls.

43:10 But the computer is doing it all for you behind the scenes.

43:13 You don't even have to think about it.

43:14 Yeah, absolutely.

43:15 It's just, it's the way programs execute, right?

43:18 And you're just taking advantage of that.

43:20 It lines that up for you.

43:21 That's cool.

43:22 So this whole recursion thing is more or less, is very good at solving this kind of divide and conquer inductive problem.

43:30 If you can talk about some kind of base case, and you can talk about, well, how do I take like one step away from there?

43:38 You can probably apply recursion.

43:39 Yep.

43:40 I'm reading this book called How to Solve It, which is a famous math book about like solving hard problems.

43:47 And I love one of the parts in this book.

43:50 He says, if you come across a problem you can't solve, change it into a problem that you can solve and solve that problem instead.

43:59 I mean, that's so easy.

44:00 And that's the same thing with divide and conquer, where it's really hard to solve this problem.

44:05 But I'm going to just take it down to the smallest component that I can solve and use the solution for that to solve this bigger problem.

44:14 Yeah.

44:15 It kind of gives you a foothold on climbing the solution or whatever.

44:19 Yeah.

44:20 So you have two examples of divide and conquer that you gave in this area.

44:24 Yeah.

44:25 And I'm going to talk about the quick sort one because it's so elegant.

44:31 Yeah.

44:31 It's a great example of divide and conquer.

44:33 And again, such a simple idea.

44:36 You have an array of elements that you want to sort, but you don't know how to sort an array.

44:41 Well, what's an array you can sort?

44:43 How about if the array had zero elements?

44:46 That's pretty easy to sort.

44:47 It's just there's nothing to sort.

44:50 It's sorted.

44:52 Similarly, if you have an array with one element, pretty easy.

44:54 If you have an array with two elements, it's still, you know, you just check which one's bigger and put it at the end.

45:00 So all of these are the easy examples.

45:03 Now you get to an array with three elements.

45:06 And quick sort says, just pick an element from the array.

45:10 So it doesn't matter which one.

45:13 So I'll just pick the first one.

45:14 So let's say your array is 5, 3, 7.

45:18 So I pick 5 as my, it's called the pivot element.

45:22 And now I look at the rest of the elements in the array.

45:28 So 3 and 7.

45:29 And I know that 3 is less than 5.

45:32 And I know that 7 is greater than 5.

45:35 So now I have these two sub-arrays, right, of elements less than the pivot and elements greater than the pivot.

45:42 And now I just call quick sort again on those two arrays separately.

45:49 So I call quick sort of this array that only has the element 3.

45:53 And we know how to solve, we know how to sort an array with one element.

45:58 It's just 3.

45:59 And then you have the pivot because that's the number greater, you know, we know that that's greater than 3.

46:06 And then you call quick sort on the second array, which has the 7.

46:10 And again, it's just one element, we know how to sort that.

46:13 So you end up with these three sub-arrays, one with just a 3, one with just a 5, and one with just a 7.

46:20 And you just smash them all together and you have a sorted array.

46:25 So using just the knowledge of how to sort an array with 0, 1, or 2 elements, you started an array with 3 elements.

46:33 And now that you can do that, you can sort an array with 4 or 5 elements.

46:36 And you can kind of sort any array you want just by solving that small problem.

46:41 Yeah, you just continue to break it down, even if you have a million, right?

46:44 Exactly.

46:44 Nice.

46:45 Yeah, quick sort is lovely.

46:47 And the history of quick sort is pretty interesting.

46:50 So another thing, the next thing in your book that you talked about is one of my favorite data structures.

46:57 I don't necessarily use it the most, but when I do use it, it's so awesome.

47:00 And that's hash tables or dictionaries, right?

47:02 Yeah.

47:03 I mean, this is one of the reasons I love JavaScript is, you know, JavaScript objects are just hash tables.

47:11 So hash tables are such a big part of JavaScript.

47:15 And it's, I mean, like I said, I feel like almost any problem, I could just, you know,

47:20 if I just want the quick and dirty solution, I can just do the hash table and call it done.

47:24 Yeah.

47:25 So you had a really nice example of a checkout person.

47:28 Oh, yeah.

47:29 The Maggie.

47:30 The Maggie.

47:31 Yes.

47:31 You need a Maggie.

47:32 How much is an avocado?

47:33 It's $1.49.

47:33 Thank you, Maggie.

47:34 Yeah.

47:35 Isn't, I mean, that's exactly what a hash table is, where you can either look up prices

47:40 in this book and it kind of takes you some time.

47:43 Or you just have a person there who has it all memorized.

47:46 And I just say, you know, it's 67 cents.

47:50 Thank you.

47:50 Maggie is my wife's name.

47:53 And I feel like she is so much smarter than I am.

47:56 So I knew I needed to make her a character in this book.

47:59 Oh, that's a nice touch.

48:01 Yeah.

48:03 Yeah.

48:03 Very cool.

48:03 Yeah.

48:04 So if people want to get a sense of like how powerful hash tables or dictionaries are,

48:09 I just last week wrote a search engine so people could search every single bit of content of

48:16 all the podcast episodes.

48:17 So if you go to talkpython.fm, there's like a little search thing in the top right.

48:22 And you can click it and you can type in complex searches and it'll find basically anything that

48:27 matches all those keywords.

48:28 And the way it works is it goes through all the transcripts.

48:34 It goes through all of the show notes.

48:36 It goes through all the titles, all those various things and a few others.

48:40 And it turns it into a bunch of keywords and turns that into a dictionary.

48:44 And then for each keyword it finds, it figures out if there's a piece that matches, you know,

48:51 what pieces match this keyword and it puts that in there.

48:53 And if you go there, you know, it's like 80 hours of conversation plus some other stuff.

48:58 And you can type in a keyword and hit enter and it runs in sub millisecond time, 100% in Python.

49:03 You know, so you, yeah, you can search for like five, finding the things that contain these five words across 80 hours of conversation, 0.1 milliseconds.

49:13 100% by all.

49:15 Beautiful.

49:15 I mean, that is like, think of if that was trying to, you know, regular expression, the text,

49:20 or it was trying to like, you know, literally search it or whatever, right?

49:24 Like it would be insane.

49:25 You just, you're like, ah, this is too slow.

49:27 But yeah, it's so like things like that are just so possible with dictionaries.

49:32 They make me happy.

49:33 Cool.

49:34 So let's talk about some of the other algorithms.

49:37 And we're kind of getting short on time.

49:38 So maybe just sort of skip over and just touch on them a bit.

49:41 Sure.

49:41 Again, chapters five and six are super useful to me, you know, hash tables and graphs.

49:51 And a graph is this really simple idea where you model a problem using nodes and edges.

49:58 So my example is, you're trying to get from Twin Peaks to the Golden Gate Bridge.

50:04 And this is how you can tell that I live in San Francisco.

50:08 And you're trying to figure out what is the least number of bus transfers I have to do to get to from Twin Peaks to the Golden Gate Bridge.

50:16 And so you can model that, you can model it using a graph where you have one node, which is the Golden Gate, which is Twin Peaks.

50:24 And then you kind of puts out edges, which are all the different bus routes you can take to the next part, to the next transfer stop.

50:33 And then that one puts out edges of all the buses you can take from that one to the next transfer stop and so on until you hit the Golden Gate Bridge.

50:43 And it's a, this is a classic, it's called the shortest path problem.

50:47 Another example would be, you're on Facebook and you're trying to find, you really want to talk to, I don't know, someone famous, Brad Pitt, for example.

50:59 And so you're trying to figure out what is the shortest number of connections to Brad Pitt?

51:04 Like, you know, what is the least number of people?

51:06 Which one of your friends could introduce you indirectly?

51:08 Something like this, right?

51:09 Exactly.

51:10 I mean, LinkedIn does this where they say, like, you're, you know, so many connections away from this person.

51:15 And it's just graphs.

51:17 It's a graph problem.

51:18 Nice.

51:18 Okay.

51:19 And then you talk about greedy algorithms and your dynamic programming.

51:23 Yep.

51:24 And again, greedy algorithms are so simple.

51:26 It's just do the simplest thing you can do.

51:31 So, you know, my example is you are a thief.

51:36 This is the classic knapsack problem.

51:38 So, you're a thief in a department store and you have a knapsack and you're trying to figure out what items can I steal to get the maximum value, to steal the maximum value of items.

51:50 And different items have different values, but there's only so much space in your knapsack.

51:55 So, the greedy approach says, pick the most expensive item that will fit in your knapsack and put it in there and then steal the next most expensive item that will fit and keep going until you have filled your knapsack.

52:07 And it doesn't give you a perfect solution, but it gives you a good enough solution where it's good enough for most cases.

52:15 Yeah, this optimization problem.

52:16 Interesting.

52:18 Yeah.

52:18 And then recommendations with K nearest neighbors.

52:21 Oh, yes.

52:22 Again, really simple concepts.

52:25 My example is, let's say you are a Netflix user and Netflix is trying to recommend movies to you.

52:31 And they know that you have, I love the Matrix, for example.

52:35 So, they know that I've rated the Matrix five stars on Netflix and a bunch of other movies.

52:39 So, they look for other users that have similar ratings on those movies.

52:46 So, they might say, like, you rated the Matrix five stars.

52:50 It looks like Keanu Reeves rated the Matrix five stars also.

52:53 And then you rated, I don't know, 101 Dalmatians five stars.

53:00 And Keanu Reeves rated 101 Dalmatians five stars also.

53:03 So, it seems like the two of you have a common taste in movies.

53:08 We're just going to look at what other movies Keanu likes that you haven't seen.

53:13 And we'll just recommend them to you because you probably will like them also.

53:18 Yeah, that's cool.

53:19 That's a pretty simple recommendation engine.

53:21 But I recall a few years ago that Netflix had, like, a challenge to the community to build the best recommendation engine.

53:29 And they had, like, a million dollar prize or something big like that, right?

53:32 Do you remember?

53:33 Did you hear about this?

53:34 Yeah, I did.

53:35 And I think they used to use K-nearest neighbors.

53:39 I'm not 100% sure.

53:41 But I think they used K-nearest neighbors before that price came out.

53:45 So, that's how, you know, it worked for them for so long.

53:49 And I think the current version uses a modified version of K-nearest neighbors.

53:54 Yeah, I don't see how you get around something like this being at least part of the solution, right?

53:58 Yeah.

53:59 Awesome.

54:00 I think that's quite a good introduction to algorithms.

54:03 You know, if you're out there listening and you didn't have a formal computer science education or, like me, you kind of paid attention and you forgot and these ideas were living in the edge of your memory, but you wouldn't mind a reminder.

54:16 I think this is a really interesting way to learn it with this nice illustrations and simple stories.

54:21 I really appreciate your book.

54:22 I think we're probably out of time, so I have to leave it there.

54:24 Let me ask you a couple of questions before I let you go, though.

54:28 I always ask my guests.

54:29 Sure.

54:29 Yeah, there's now over 90,000 PyPI packages out there, distinct packages.

54:34 I'm sure you've come across some that you found interesting that maybe not everybody knows of.

54:38 Anything come to mind you want to recommend?

54:41 Oh, my gosh.

54:41 I'm sure everyone knows about the one I'm going to recommend, but it's called NumPy.

54:47 Oh, yeah.

54:47 And it's, you know, I'm starting to get more into machine learning and it's so useful.

54:55 Yeah, I think the whole things like NumPy and SciPy and the whole data science story has really opened up a whole new avenue for Python to grow, right?

55:07 It's not just a web development technology.

55:09 It's also so much for science and it's amazing what people are doing with it.

55:12 Editor, if you're going to write some Python code, what editor do you use?

55:17 I have to go with Vim.

55:18 All right, Vim, right on.

55:20 Okay, so any final call to action?

55:23 How do people find your book?

55:24 Things like that.

55:25 So if they just go to my website, it's audit.io, A-D-I-T dot I-O.

55:31 There's a link to my book and there's blog posts there.

55:36 Oh, you said that people can get the pictures and use them like for their classes if they're a teacher or something.

55:42 Yes, that's something not a lot of people know.

55:45 But all the images from the book are available for free online and high res.

55:52 So if you're a teacher and you want more images related to algorithms, there's like 400 images from this book and they're all on my GitHub.

56:03 So it's github.com/Egon Schiele.

56:07 And maybe you can add a link so I don't have to spell that out.

56:11 Yeah, I'll definitely link to that.

56:12 No problem.

56:12 That'll be in the show notes.

56:13 All right.

56:14 Well, it's been great to talk to you and I definitely recommend your book to people.

56:19 I think it's very approachable.

56:21 So if this kind of thing is interesting to you, check it out.

56:24 Thank you so much.

56:24 You're welcome.

56:25 Thanks for being on the show.

56:26 Talk to you later.

56:26 Take care.

56:27 This has been another episode of Talk Python To Me.

56:31 Today's guest has been Adit Bhargava.

56:34 And this episode has been sponsored by Capital One and Intel.

56:37 Thank you both for supporting the show.

56:39 Are you a data scientist or Python developer who loves data?

56:43 If you're looking for a place to work on data science with truly big data that can affect millions of lives,

56:48 then head on over to jobs.capitalone.com slash talkpython and check out the wide range of jobs that Capital One is trying to fill right now.

56:57 The Intel distribution for Python delivers the high performance Intel C libraries built right into Python.

57:03 Get close to 100 times better performance for certain functions when using NumPy, SciPy, and scikit-learn.

57:09 Check them out at talkpython.fm/intel.

57:12 Are you or a colleague trying to learn Python?

57:15 Have you tried books and videos that just left you bored by covering topics point by point?

57:20 Well, check out my online course Python Jumpstart by building 10 apps at talkpython.fm/course to experience a more engaging way to learn Python.

57:29 And if you're looking for something a little more advanced, try my Write Pythonic Code course at talkpython.fm/pythonic.

57:37 You can find the links from this episode at talkpython.fm/episodes slash show slash 82.

57:43 Be sure to subscribe to the show.

57:45 Open your favorite podcatcher and search for Python.

57:47 We should be right at the top.

57:48 You can also find the iTunes feed at /itunes, Google Play feed at /play,

57:54 and direct RSS feed at /rss on talkpython.fm.

57:58 Our theme music is Developers, Developers, Developers by Corey Smith, who goes by Smix.

58:03 Corey just recently started selling his tracks on iTunes, so I recommend you check it out at talkpython.fm/music.

58:09 You can browse his tracks he has for sale on iTunes and listen to the full-length version of the theme song.

58:15 This is your host, Michael Kennedy.

58:17 Thanks so much for listening.

58:18 I really appreciate it.

58:19 Smix, let's get out of here.

58:22 Stating with my voice, there's no norm that I can feel within.

58:26 Haven't been sleeping, I've been using lots of rest.

58:28 I'll pass the mic back to who rocked it best.

58:31 I'll pass the mic back to who rocked it best.

58:44 Thank you.