#455: Land Your First Data Job Transcript
00:00 Are you interested in data science, but you're not quite working in it yet?
00:03 In software, getting that very first job can truly be the hardest one to land.
00:08 On this episode, we have Avery Smith from Data Career Jumpstart here to share his advice
00:13 for getting your first data job. This is Talk Python to Me, episode 455, recorded January 18th,
00:20 2024.
00:21 Welcome to Talk Python to Me, a weekly podcast on Python. This is your host, Michael Kennedy.
00:40 Follow me on Mastodon, where I'm @mkennedy, and follow the podcast using @talkpython,
00:45 both on fosstodon.org. Keep up with the show and listen to over seven years of past episodes
00:51 at talkpython.fm. We've started streaming most of our episodes live on YouTube. Subscribe to our
00:57 YouTube channel over at talkpython.fm/youtube to get notified about upcoming shows and be part of
01:03 that episode. This episode is brought to you by Sentry. Don't let those errors go unnoticed.
01:09 Use Sentry like we do here at Talk Python. Sign up at talkpython.fm/sentry.
01:15 And it's brought to you by Posit Connect from the makers of Shiny. Publish, share, and deploy all
01:21 of your data projects that you're creating using Python. Streamlit, Dash, Shiny, Bokeh, FastAPI,
01:27 Flask, Quarto, Reports, Dashboards, and APIs. Posit Connect supports all of them. Try Posit Connect
01:34 for free by going to talkpython.fm/Posit, P-O-S-I-T.
01:39 Hey folks, before we jump in and talk about data science jobs and careers, I want to tell you
01:44 really quickly about some awesome news. Back in February, I gave the keynote at PyCon Philippines.
01:50 It was entitled The State of Python in 2024. Well, that is now out on YouTube. The team at
01:58 PyCon Philippines did a great job. The video came out great. If you want to check out The
02:02 State of Python in 2024, according to me, just click on the link in the show notes to watch
02:07 it over on YouTube. Now let's talk to Avery. Avery, welcome to Talk Python to Me.
02:11 Thanks so much. I'm so excited to be here and be part of the show.
02:15 I'm excited to have you here as well. You know, one of the things that people reach out to me often
02:19 is how do you get into data science? How do you get into programming? How do you get into Python?
02:26 You know, I've been trying, or maybe they got a degree or they took some training program,
02:32 bootcamp or something. And going from zero to one, I think is the biggest career step you have to make.
02:40 That next job and the one after that, it only gets to be smaller steps, not bigger steps.
02:45 And it's really tough because that first big step, you're brand new at it. You have no experience,
02:49 right? It's your first data science job or your first programming job. And so hopefully we can
02:54 give some folks out there a little bit of a hand up to help them make that jump.
02:59 Yeah, totally. I like to show this graphic that says, it's a circle and it's a circle of text. And it says,
03:05 I can't get a job because I don't have experience because, and then it restarts,
03:11 I can't get a job. And that's the tricky part. It's like, how do you get a data science job when
03:15 you have no data science experience? Because to get data science experience, that seems like you have
03:20 to have a job as the prerequisite and vice versa. So it is very tricky. So happy to chime in on that
03:25 today.
03:26 The industry can take it too far. They can take it way too far. So a few years ago, there was a
03:32 really funny tweet that went around back when they call them tweets. I don't know what they're called
03:36 anymore. Sebastian Ramirez, the guy who created FastAPI, saw a job posting. When FastAPI was like a
03:44 year and a half old, it said, you must have four years of experience with FastAPI to apply. He said,
03:50 hey, look, I'm the creator of FastAPI and I'm unqualified for this job. What kind of world are
03:55 we living in?
03:56 Yeah. I don't want to live in that world, but that's unfortunately where we're at. That's so tough.
04:01 And it's hilarious. These job descriptions are getting out of hand. That's for sure.
04:06 Yeah. Well, with AI, it's probably not going to get better.
04:08 We could talk about that more later. But before we get into that, let's just jump in with a little
04:13 bit of background on you before we get to the topic. Tell us a bit about yourself. What do you do?
04:18 How'd you get into Python? Things like that.
04:20 Yeah, absolutely. So I'm currently a data science consultant and also a data science instructor.
04:27 I run some online programs where I teach people to become data analysts mostly is what I'm focused on.
04:34 But I also have this practice where I help companies solve data problems with different techniques.
04:39 I started actually by studying chemical engineering in college in my undergraduate degree.
04:45 And about a semester in, I realized, crap, I hate this. This is not for me.
04:50 But I was a little on a little of a tough. Yeah. Do you agree? Have you felt something similar?
04:54 I did a semester of chemical engineering as well. I thought, I love chemistry. I love math. Put them
04:59 together. Somehow they don't go together. It's like ice cream and eggs or something. No,
05:04 they don't go together for me at least.
05:06 Yeah. It wasn't good for me either. I was just like, oh man, I'm actually not interested
05:10 in refineries or like manufacturing. But I, like you, liked chemistry. I liked math. I thought this
05:16 is perfect. But I quickly realized, oh man, I really liked this whole programming part that I get to do
05:21 in MATLAB at the time when I was an undergrad. And I was on a time crunch to get through college kind of
05:27 quickly through eight semesters. And the other issue I had was I didn't know what to do instead. It was
05:33 like, I don't really want to study computer science. Part of the reason why is they kind of
05:37 had this weed out course at the beginning, which you had to build Excel from scratch, basically like
05:43 some sort of a spreadsheeting tool. And I was like, why would I rebuild something that already exists
05:48 that I don't even like using in the first place? I wasn't really into it. So I didn't, I didn't know
05:52 what to do. And luckily I was working as a lab technician at this company, the really cool company
05:58 that makes the sensors that basically have the ability to smell. So they can sniff what's in
06:04 the air and it has applications for finding drugs or bombs and airports and stuff like that.
06:09 And there was a data scientist on staff and that data scientist was awesome. He was like showing me
06:14 all these cool algorithms he was writing for these sensors. And then one day he got up and left and he
06:20 left the company and we tried to hire another data scientist for like six months, but they were really
06:26 expensive. We were a small company and none of them really wanted to move to Utah where I lived in
06:30 Salt Lake City. And so we couldn't, we couldn't really find someone that would be able to do it.
06:33 And so finally I was like, well, I really liked this programming stuff. And I, you know, the data
06:38 scientist showed me a thing or two, maybe I could take a stab at this. And I started, I wrote like my
06:42 first machine learning algorithm and I was like, oh my gosh, I'm addicted to this. And then I never
06:46 looked back and had been data science since basically.
06:49 What a great story. Yeah. I think, I think a lot of people fall into programming that way. And for some reason,
06:55 not unexpectedly, but for some reason, a lot of people fall into Python that way as well. They're
07:00 like, you know, I have a job and I got this thing I got to do. I just need a little bit more than maybe
07:05 like an Excel spreadsheet or something and put it together. And you're like, actually, this is cool.
07:10 After a while, like, this is cooler than what I've been doing, or maybe I'll make it a good part of what
07:14 I do. Right.
07:15 Yeah. A hundred percent. Even just making, it was in MATLAB, which is basically engineers version
07:20 of Python or college version of Python 10 years ago. Right. And I made like tic-tac-toe and I remember
07:26 playing tic-tac-toe against the computer. I think that's what it was. Or maybe it was,
07:29 maybe it was Hangman. I can't remember. But I remember like the idea of like being able to play,
07:33 to program games and play against the computer. And I built it. I was like, this is the coolest thing
07:38 ever. I got to, I got to do more of this.
07:40 Absolutely. You know, I think I've done some MATLAB too, when I was younger and it's not that
07:44 different from Python, but it's, I think one of the big differences other than it just being like
07:49 embedded in a big expensive app is it's not a general purpose programming language, right? You
07:54 wouldn't go, you know, that was fun, but let me go build this website in MATLAB or let me create
08:00 Airbnb and MATLAB or, you know, like there's, you just don't want to sort, Azure has this like
08:06 self-prescribed limit to what you can do with it.
08:08 That's one of the coolest parts about Python is it's really a Swiss army knife and you can pretty
08:13 much do, I don't want to say anything, but pretty close to anything in Python, which makes it really
08:19 neat. And obviously one of the huge limitations of MATLAB is one, it costs thousands of dollars,
08:23 but two, you're right. It's not going to do cybersecurity for you. It's not going to build
08:27 websites, but the syntax at the end of the day was, was really quick. It was, it was easy for me to
08:32 transition from MATLAB to Python because the syntax isn't all that different.
08:35 No, it's not all that different. More math focused, but pretty similar. So I think maybe that's a good
08:41 place to start discussing and exploring the topic of your first data science job. And
08:46 wouldn't necessarily plan on starting here, but let's, let's start with before you even necessarily know
08:50 programming language, right? Maybe you've dabbled in MATLAB or you've dabbled in Excel or even dabbled in,
08:57 I don't know, JavaScript or something. This thing we've been talking about with MATLAB and it applies to other areas as well,
09:02 like through programming languages per se, like Julia or something like that, is how,
09:08 if you invest your time into learning one of these things really well, like how broadly industry-wide
09:16 of a skill, high demand skill is that going to be, right? If you learn MATLAB, you put yourself in a box,
09:21 you learn a more general programming language, you kind of have more options afterwards, right?
09:25 Yeah, totally. I think like the more broad of a language you learn, the more useful you are to,
09:31 to more industries in general. But I might take that even a step further and just say, you know,
09:37 learning MATLAB, not a whole lot of companies use MATLAB, but just like landing your first data job,
09:43 going from zero to one is the hardest, learning your first language, zero to one is the hardest as well.
09:48 And then once you have that first language, the next language becomes so much easier. So
09:52 one of the first things I learned was MATLAB. And then I moved to Python and that was easier. And then
09:57 I learned SQL and then I learned R and then I learned JavaScript. And every time I added like a new tool
10:02 to my toolkit, it was quite, not almost, it was easy, but it got easier with each one. I think that's true
10:07 with foreign languages as well. Once you learn one foreign language, then the third and the fourth become
10:12 quite easy. At least that's, that's what I heard. I speak kind of through two and a half languages,
10:17 but like I, there's people who speak like seven and they always say like the sixth and the seventh
10:21 become easier.
10:22 Yeah. You wonder how could you probably, because learning the first one is so hard,
10:25 first foreign language. So you're like, well, how could you possibly take that on for this many
10:29 languages? And it's that it's not the same challenge each time, right?
10:32 Yeah, exactly.
10:32 Yeah. So I think when people are considering getting into data science, they really want to consider
10:38 what language they choose and where they go. Like you're coming out of a
10:42 college program. You might feel like MATLAB or something like that's real popular. And yet
10:47 that's because it's popular amongst professors who forced their students to do it. That doesn't
10:51 necessarily mean that's the world, the broad worldview. What do you think about R? You know, both.
10:57 I like R. I'm not, I sometimes troll R on LinkedIn. So I guess that's another thing I should say is I post
11:03 a lot on LinkedIn, kind of a LinkedIn guy. And so a lot of the times, honestly, just for jokes and kicks
11:08 and giggles, I'll kind of roast R on LinkedIn just to get the trolls angry in the comments.
11:13 I've invented it. It's quite fun. It's quite a fun experience, but I'm not that big of a hater. I
11:18 think that's really interesting about R versus Python is obviously a big debate in the data
11:23 science community is R is kind of that does one thing really well. And it's getting a little less
11:29 of that as like more packages and libraries are added to R, but R does the statistics and machine
11:34 learning very well. But obviously I don't think once again, I don't know any websites, any like
11:39 super functioning websites that are built on R. I don't know any cybersecurity that's really done
11:44 done, done VR. So I think R does what it does well. The syntax sometimes is a lot easier for
11:49 people to go from Excel, which a lot of people are more familiar with in the finance or banking world,
11:54 for example. The syntax in R is a little bit more similar to those Excel formulas than it is to Python.
12:00 So I think sometimes people have a little bit more success just because, oh, this kind of feels like
12:04 our formulas are sorry. This feels like Excel formulas. And so people really get there. I think what
12:10 you're kind of alluding to is if you're going to learn one skill, you might as well learn the one
12:14 skill that's applicable to the most, the widest net, right? And so that way you're fishing in the
12:20 biggest lake you possibly could versus in a smaller pond of R. I think that's worth looking at. And one
12:27 of the things I actually really enjoy doing, because you know, you mentioned, oh, you might think MATLAB
12:31 is popular because that's what the professors taught you. And there's actually not a whole lot of
12:35 data out there about, well, what should you learn? So I don't know if you know who Luke Bruce is. He's a data
12:41 analyst YouTuber. I was going to say YouTuber on YouTube, but that's kind of redundant on YouTube. And
12:45 one of the things he's done is he's actually built this tool where he's web scraping thousands of jobs,
12:50 different data jobs every week, and then displaying and analyzing the skills required for those jobs.
12:56 So it's actually like a data driven way of saying, if you want to be a data scientist, what skills should you
13:01 actually be focusing on as you go, as opposed to just listening to what a professor will say,
13:07 or what a LinkedIn influencer will say, or what your bootcamp will say. Like actually getting some
13:13 data on, I think is pretty neat. That is super cool. And I'm not familiar with Luke. So we're going to dig
13:19 him up and put him in the show notes for later so people can check that out. For sure. Do you remember
13:23 any of the trends you've recently talked about? It's datanerd.tech, I think is the website there.
13:27 I look at it mostly for data analysts because that's who I work with the most. So I know the
13:33 data analyst data very well. SQL is number one at 50%. I think Python is number two at like 30%.
13:40 I think Python might've jumped it. Well, this is for all data positions right here. So the job title,
13:47 you can choose. So which one do you think I should pick here? Data?
13:51 Maybe data scientist. Data scientist. Yeah. Right.
13:53 What's that? Yeah, you're right. Wow.
13:56 Whoa, Python 69%. Look at that. That's huge. So like, that's even, that's even what? 20% more than SQL,
14:04 which a lot of people are like, if you were going to be a data scientist, you have to know SQL.
14:07 Yeah. If you look at the job descriptions, Python's mentioned a lot more. So if you're going to learn,
14:11 if you're brand new and you're going to learn one, you might as well start with Python. Because that's
14:15 probably the most in demand skill that there is right now for a data scientist.
14:18 Yeah. And it's pretty easy, right? It's not like, well, why don't you just learn C++ for
14:22 embedded devices? You're like, you know what? Maybe I'll pick something else to start with. Right. But
14:26 you know, Python's pretty easy. I agree with you. I think Python's great. I actually think,
14:31 I think SQL is probably easier to learn if I'm being honest, because really, especially for like
14:35 data science stuff, there's only about like 20 commands that you need to know in SQL. But it's,
14:40 once again, SQL's a lot more, there's no websites built on SQL. I'll tell you that much. So
14:44 it's a lot more limited on what it can do.
14:46 It's a skill, but not the language. It's not enough on its own, generally. I mean,
14:52 you can do reports and quite a bit with it. But you know, it's like, when you see these programming
14:57 popularity, like what's the most popular language? Oh, look, CSS is the third most popular. That's not
15:02 a language. That's a thing that you use with other languages, right? Like use it with all the other
15:06 languages. That's why it's high up. But that doesn't mean it's high in demand. Exactly. It's just like
15:11 table stakes, you know? Yeah.
15:12 So you kind of got to distinguish table stakes from like picking an area, I think.
15:17 That's totally true. And really, I think Pythonistas could make the argument that there's
15:22 really nothing in SQL that you couldn't do in Python. That's a little somewhat true,
15:28 true depending on data size and stuff like that. But regardless, there is ways that you can do
15:32 most of the SQL commands in Python one way or another. Yeah. Yeah.
15:36 It could be when I first became a data scientist, I didn't even know SQL and I was doing SQL commands
15:41 or I was doing the aggregations or the where functions or the window functions using Python.
15:47 So you definitely can. As long as your data is not like super big, then you'll totally be fine.
15:51 Right. Like some kind of generator or even slices or yeah, things like that, right? List comprehensions,
15:57 set comprehensions, all that kind of stuff. Kind of like, gosh, I really wish, a little bit of a
16:02 sidebar, but I wish like list comprehensions and all those things had just a few more SQL features,
16:08 right? Like in a list comprehension, I say, give me this thing, maybe give me this property of this class
16:14 modified, like give me the user's name, uppercase. Right. So that's like select. And then for thing
16:20 in collection, that's like from table or whatever. Right. And then you have the where clause with the
16:27 if statement, but boy, wouldn't it be cool to have like a sort also in there and other things like that,
16:33 you know? Oh, well, totally. It's so close. The cool thing is, is if you want that sort,
16:38 it's what one extra line. Like it's, it's not, it's not too bad. So it, Python, I mean,
16:43 I don't want to say this necessarily to hate all, to make all the data scientists and SQL lovers
16:47 mad, but, but really Python can do a lot of the things that SQL that's for sure.
16:52 Yeah, that's for sure.
16:53 This portion of talk Python to me is brought to you by Sentry code breaks. It's a fact of life with Sentry.
17:00 You can fix it faster. As I've told you all before, we use Sentry on many of our apps and APIs here at
17:06 Talk Python. I recently used Sentry to help me track down one of the weirdest bugs I've run into in a long
17:12 time. Here's what happened. When signing up for our mailing list, it would crash under a non-common execution
17:19 past, like situations where someone was already subscribed or entered an invalid email address or
17:25 something like this. The bizarre part was that our logging of that unusual condition itself was crashing.
17:32 How is it possible for her log to crash? It's basically a glorified print statement. Well, Sentry to the
17:39 rescue. I'm looking at the crash report right now, and I see way more information than you'd expect to
17:44 find in any log statement. And because it's production, debuggers are out of the question.
17:48 I see the traceback, of course, but also the browser version, client OS, server OS, server OS version,
17:56 whether it's production or Q and A, the email and name of the person signing up. That's the person who
18:01 actually experienced the crash. Dictionaries of data on the call stack and so much more. What was the
18:06 problem? I initialized the logger with the string info for the level rather than the enumeration dot info,
18:14 which was an integer based enum. So the logging statement would crash saying that I could not use
18:20 less than or equal to between strings and ints. Crazy town. But with Sentry, I captured it, fixed it,
18:27 and I even helped the user who experienced that crash. Don't fly blind. Fix code faster with Sentry.
18:33 Create your Sentry account now at talkpython.fm/sentry. And if you sign up with the code
18:39 TALKPYTHON, all capital, no spaces, it's good for two free months of Sentry's business plan,
18:46 which will give you up to 20 times as many monthly events as well as other features.
18:51 Probably the biggest, it's a bit of a diversion, but the biggest similarity to that I've seen in
18:57 the languages is C#'s link where they actually have almost all the query operators, including
19:04 joins and stuff like that built into the programming language. I'd love to see more of that kind of
19:09 inspiration into Python, but you know, that's all right. It's still really good. I've got a lot of
19:13 cool SQL-like features, but you're right. Once you are no longer working with data and memory,
19:17 or you want indexes, right? Like this concept of indexes is not sufficiently well understood. I think
19:24 every time I hit a website that takes five seconds to load, I'm like, somebody is not doing all the
19:28 things they should be doing. I just know it. That's totally true. What about SQL? You know,
19:34 let's talk about that for a bit, right? The SQL, the query language or databases and other things,
19:40 there's ways to SQL query, not just relational databases. But you said you got away with not
19:47 quite learning that, but do you think if you could start over, maybe making an effort to learn that
19:52 would be really valuable? Like how, how important is this a thing in the beginning of your career?
19:56 The interesting thing, you know, about landing a data job is your skills only plays, I say,
20:03 a third of the role. Your portfolio or the way that you portray your skills and your network,
20:09 I think are the other two thirds and they're actually more important than your skills. And that's
20:12 kind of how I got away with not knowing SQL and not even being, to be honest, that good at
20:17 Python at the time was because I used my network to be in the situation to get my lab technician job
20:24 in the first place. And then once again, I use that same network, in this case, my coworkers,
20:28 to land that first data scientist position after we couldn't hire anyone. And if I would have been
20:33 applying externally for that role, chances are I wouldn't have gotten that role. I probably didn't
20:39 know enough at the time to land that type of a role, but because they knew I was hardworking,
20:43 they knew I wasn't like a total idiot and I really liked to learn. They took that chance on me. It
20:49 paid off really well for them because at the time I was still in college. And so I wasn't getting paid
20:53 that much. And I was getting, I was not getting paid like a data scientist, but I was getting results like
20:58 a data scientist for them. So I think it was, it paid off for both of us. But I think if that was an
21:03 external job and I applied for it, I probably didn't have enough skills for it. So I definitely think
21:07 learning SQL, if you want to land data science job, isn't a bad place to start, especially because,
21:12 like I said, there, I mean, any programming language, I like to think of like the iceberg,
21:17 kind of like the Titanic, right? There's the parts that you see, and then there's the parts that you,
21:21 that you don't even know that you, that are there. And, and really you could spend the rest of your life
21:26 trying to master SQL or the rest of your life trying to learn Python. But the cool thing is,
21:31 is a lot of the time you only need that top little bit that's sitting at the,
21:34 the top of the surface of the water to actually get stuff done. And so for SQL, I think that's like
21:40 20 commands. And I think you could learn it honestly in like a month, you could learn those,
21:45 those 20 commands pretty easily, but it worked out for me. And I, I didn't have to use it that much
21:49 at the time until I was probably about almost three years into my job. And I actually had switched jobs
21:54 to a bigger company. The other thing that I was working for a smaller company where we didn't have a ton
21:58 of data. So we could use CSVs kind of as our, our database, which is not great practice. But when I,
22:05 when I eventually became a data scientist at Exxon mobile, I was going to say they didn't use Excel as
22:09 a database, but they still did. But the point is they had much larger SQL databases with hundreds of
22:14 thousands, actually millions of rows of data that I had to query.
22:17 Yeah. Then you gotta be really, you need to understand it at a much deeper level. You're
22:22 like, if you do a query like this, it's going to be super slow. But if you do it like that,
22:26 it can use the composite index for the sort and then blah, blah, blah, blah. All right. Then
22:30 you're getting to the bottom of the, the iceberg in SQL, or maybe not the bottom,
22:33 maybe like the middle chunk under the water, but there's so much to learn for both of them.
22:37 Amir at the audience asks, you know, like when you talk data job, like what kind of jobs are out
22:42 there? Right. So we talked to both about how we did chemical engineering and then we saw like
22:46 chemical factories, like, yeah, I don't really want to work here anymore. I'm out.
22:49 So thinking about like, well, what are the kinds of jobs you do? I think that's really important because
22:55 it's easy to get focused in on like the FANG companies. Like I want to work for like some super
23:02 big tech company. I want to move to San Francisco and like that, that, that, right. Like there's not just
23:07 plenty of other jobs, but the opportunities, just like you described, and as well as
23:12 like my first job, I worked at a company that had like eight people and it was awesome. Right.
23:17 They didn't expect me to be, you know, running Kubernetes clusters and doing all sorts of great.
23:22 They're just like, I need you to make this thing happen. Can you do like, I'm pretty new,
23:26 but that thing I can make that happen. Like, let's go. Right. And I feel like the, the possibilities
23:30 to get in, especially with these maybe more niche type of industries and companies might even be easier
23:37 for a first job. People seem to be really obsessed with, with the FANG. And I don't know if that's
23:41 like a societal thing, or if it's just, those are the companies that we use a lot. And so we're
23:46 excited about them, but yeah, there's so many more data jobs outside of FANG than there are inside of
23:51 FANG, even though there's, there's quite a bit inside of FANG. And oftentimes those roles can be
23:57 much more interesting and you can do a lot bigger of an impact. When, when I was working at the small
24:02 company, VaporSense, I like, I had so much power. I didn't even realize it. I had such a big effect on the
24:08 company. I was presenting to, you know, Fortune 500 companies and what I did really made a difference.
24:14 And when it came to the point where ExxonMobil offered me to go be a data scientist for Exxon,
24:20 I said, Oh, I want to go work for the big company with the nice desk and the nice laptop and, you know,
24:26 try something new. And when I got there, I really, I had some pretty cool opportunities when I was at
24:31 ExxonMobil, but ultimately I left pretty shortly after two years of being there because I just felt like a
24:36 cog in the machine and I didn't feel like I was actually making a difference. And that was really
24:40 important to my work satisfaction of like, is what I'm doing being used? Is it being used to better the
24:45 world? Do I feel valued? And the answer was kind of no for me when I was there. So there's definitely
24:50 a trade-off between the small companies and the big companies, but also to go back to your original
24:54 question, there's so many freaking roles in the data world that you're not even thinking of that. Like,
24:59 I'm not even thinking of, I saw a new one the other day when I was helping one of my students.
25:04 It was like, it wasn't data janitor, but it was something like that where I was like, I don't
25:08 even know what that role is, but there's, there's so many roles. When I was, when I was a data scientist,
25:12 VaporSense, the small company, my actual title was junior chemometrician, which basically means
25:19 you're doing data science with chemistry. When I was at ExxonMobil, when I was first there,
25:24 I was doing data science, but my actual title was optimization engineer. And so there's so many
25:29 titles that we don't even think to search of, or even to look up, but those are all data science
25:33 roles. I was doing machine learning every day in both those roles. And you would maybe never guess
25:37 from those titles. Yeah. You would never guess. No, that's awesome. What machine learning libraries,
25:42 frameworks were you using? At VaporSense, once again, because it's a smaller company,
25:46 I had a lot more say in what I was doing. We were building a bunch of machine, we were building
25:51 classification models to basically to take the data from our sensors and sniff if something was in the
25:56 air. Sometimes that was a yes, no, like, oh yes, there is ammonia in the semiconductor factory and
26:02 that's bad. So that's a yes classification kind of binary, right? Other times it was, what drug is this?
26:09 Is this meth or is this heroin? One of the use cases we had was, this is binary once again, but is this
26:15 recreational marijuana or medicinal marijuana? And can we tell the difference between, between those?
26:20 So we are usually using classification models, usually built in scikit-learn in Python, the majority of the
26:26 time there. When I was at Exxon, we had a lot less say, like the data scientists had a lot less say in
26:32 the decision making process. We were doing a lot of multivariate linear regression with a lot of crazy
26:39 hacks and transformations kind of in the meantime for one of my positions there. And then the other time,
26:44 the other position I did there, we were doing a lot of auto ML using PyCaret and letting it kind of
26:50 decide what type of models to do. So. Okay. The unsupervised learning type stuff, huh?
26:54 It was awesome. It was really fun to, to, I love PyCaret because it's like, okay, go make 25 models and
27:00 tell me which one's the best. It's like, takes, makes my job easy, I guess.
27:03 We're going to be creative with sheer numbers. That's how we're going to come up with a solution.
27:08 Got it. Exactly.
27:09 Well, Diego is asking like, what are some of the common stats methods as in mathematical type stuff
27:16 you would use? So one of the things I know that some people getting into programming think is you've
27:22 got to be really good at math to be a programmer. I think you've got to be really good at logical
27:26 thinking, but you need to almost zero math be like a web developer. You know, we're talking percents
27:33 for CSS, incrementing numbers from one to two to two to three for IDs and stuff like that. But for
27:40 data science, maybe there's a little bit more like, where do you see that kind of background?
27:45 I like what you said, you have to think logically, but maybe the math isn't as important. And I think
27:49 it's actually somewhat similar in data science. I will say you probably need a little bit more math
27:54 than a web developer, but I think it's a lot less than most people think. And it's probably less
27:59 about being able to do the math and maybe more about understanding the mathematical concepts.
28:04 And what I mean by that is a lot of, a lot of, so I also have a master's degree in data analytics.
28:10 A lot of master's degrees in data science and data analytics will say you need calculus and linear
28:15 algebra as kind of a background for your math. And that kind of stops people. I don't want to do any
28:19 calculus. I don't want to do any linear algebra. And while both those concepts do exist in data
28:24 science principles, the majority of the time, the computer, Python is doing the math. You just have
28:30 to be able to interpret the results of the math and kind of know what different directions, like this is
28:36 going down, an optimization problem, you know, okay, that's the derivative, you know, getting closer to
28:40 zero. Like it's really less about knowing how to do the math by hand and more just understanding what the
28:46 math the computer is actually doing. So I think it's actually a lot easier than most people say.
28:50 That being said, knowing how to do a derivative or taking integral, those concepts, I think is
28:55 probably underlying pretty important. But other than that, like a lot of the times I'm doing linear
29:00 regression because it's, it's awesome. It gets the job done. A lot of the time I'm doing hypothesis
29:06 testing and statistics, which you have to look like at a P score, nothing all that crazy. At Exxon,
29:12 I had to do a lot of linear programming, but that's honestly, that's like the exception versus the rule.
29:17 There's not a whole lot of linear programming for most data science, most data scientists. So
29:22 I really don't think the math is, is all that hard. Now, of course, that's coming from someone who
29:27 got a chemical engineering degree, who had to take all the calculus, all the linear algebra. So I did go
29:32 through those courses. I haven't really done it from scratch from like a lot of my students are teachers,
29:37 for example, who never took those courses in college. So I can't speak from that perspective,
29:41 but a lot of my students are able to figure it out at the end of the day and transfer. So it happens.
29:45 Yeah, yeah, for sure. I think there, you make a good point. I think it's about knowing,
29:50 okay, this formula or this algorithm or this test means this thing. It applies in this situation.
29:56 It doesn't apply in that situation. Here's what you're trying to get from it, right? Like,
30:00 I know I need to do a fast Fourier transform. So, and this is what it tells me when I get out the other
30:06 side. But do I need to be able to sit down and recreate the integral and the calculus behind it
30:14 and do that on like a home, like as a homework example, like, give me a function and I'll do the
30:18 Fourier transform and I'll actually do the symbolic integration. Like, no, you probably don't need that,
30:23 right? But you need to know, I do the Fourier transform in this situation and this is why. And then I just say,
30:30 call the function, do it, right? And interpret the results. Really, that's what being a data
30:35 scientist is all about is, yeah, what does the business use case, what's the desired business
30:40 use case? How do I relate that use case to the data? What technique can I use to get the outcome
30:45 that I need? Computer, go do it. Interpret results, present to stakeholders. That's a data scientist,
30:52 right? I think one of the challenges with that is going to be, not that it's not good, but I think
30:57 it's going to be challenging because how do you learn when to use a certain statistical test or when to do
31:04 some kind of funky transformation, like a Fourier transform without more traditional mathematical
31:10 backgrounds? And all the academics will not just go, oh, we're just going to give you like
31:14 five minute overview and they'll help you understand. They're like, nope, we're going to start with this
31:19 axiom or this theorem from differential equations. I'm going to work up. You're like, no, no, no, no,
31:24 no, I don't need that. I don't, I'm not on a four-year plan. I'm on a four-week plan. How do I,
31:30 how do I get value from a couple of the mathematical things without being sucked into like, yeah, now I'm in
31:36 differential equations at Harvard online and I don't understand how I got there.
31:39 It's such a big problem and I'm so glad you brought this up and I'll be vulnerable because yeah, I felt
31:45 the same, the same way. And I was like, there has to be a better way. And so about, what was it? Three
31:49 years ago now, two and a half years ago, three years ago, I said, oh my gosh, I'm going to solve this
31:53 problem and I'm going to start my own data science bootcamp. And so I spent about six months making the
31:58 curriculum, making all the videos. I opened it up. I got some students in there and I ran it for about
32:03 six months and I looked at the results and man, we weren't getting anyone into data science jobs.
32:08 And I thought, ah, what the heck am I doing wrong? I had this brilliant idea of like, we're going to be
32:12 less theory, more project, more hands-on. And I realized, man, the truth is people just learn better
32:19 at work. That's where you learn that whole technique that you just like, how does someone learn that?
32:24 The answer is by getting experience and learning it at work. And when I looked back and I said, okay,
32:28 well, we have had students get jobs. What jobs did they get? And it turns out most of them were
32:33 getting like business intelligence, intelligence engineer jobs or data analysts or financial
32:38 analyst jobs that were a little bit below a data scientist job. And I realized, oh man,
32:43 if we can just help people go from zero to one and get their foot in the door, they can go from one to
32:49 five much quicker at work because work is just, I don't know, it's this magical place, right? Like,
32:54 like you said, they, whatever you were working at earlier, and they're like, hey, can you do this
32:58 Kubernetes thing? They just kind of throw you in the fire and you're like, figure it out. And that's
33:03 somehow you do, I don't know what it is about work, but you figure it out and that's where you learn.
33:07 So that's kind of what I've, why I changed my curriculum to be more focused on, you know, okay,
33:11 maybe people aren't going to become data scientists, but can we get them to zero to one quickly?
33:15 And then they can get paid to learn the rest of the data science stuff when they're actually in that
33:19 first position. How much do you know about what you actually want to do in the industry
33:23 before you've done it as well? Right? Like, you're like, oh, I thought everybody said machine
33:28 learning was awesome. And I've used chat CPT and I loved it, but it turns out actually like API is
33:34 better, but I've never had a chance to build an API. So until I started, I didn't even learn that one,
33:39 it was a thing to that. It was cool or vice versa, right? Whatever. But until you get kind of in,
33:45 you don't even know, like, actually this part is where I really am enjoying it. And so just getting that
33:50 first step, that's a big deal. A hundred percent. You don't know what you don't know until you know
33:55 it. That's why, I mean, really when it comes to, if we like, we go back to just SQL or just Python,
34:00 you could spend, I tell people this, if you tried to master Python before you applied to a job,
34:06 you'd be like 80 years old before you ever applied to a job. Same with SQL, same with machine learning.
34:12 The cool thing about data is we're never going to know it all. And so just learn the bare minimum to get
34:17 your foot in the door. And then you have this place where you're going to get paid to learn what
34:21 you want to learn. Eventually, if you learn, oh, I love APIs. I promise you that there's a company out
34:26 there that will hire you and you can learn APIs on the job. Like that's going to happen. But that first
34:31 step is no true. There's a company out there that it doesn't know it needs APIs, but you could help them.
34:36 And you know, they don't have huge expectations because this is the thing they just learned they needed.
34:40 Right. A hundred percent. Yeah. It's wild, right?
34:42 This portion of Talk Python to Me is brought to you by Posit, the makers of Shiny, formerly RStudio,
34:50 and especially Shiny for Python. Let me ask you a question. Are you building awesome things? Of
34:56 course you are. You're a developer or data scientist. That's what we do. And you should check out Posit
35:00 Connect. Posit Connect is a way for you to publish, share, and deploy all the data products that you're
35:06 building using Python. People ask me the same question all the time. Michael, I have some cool
35:12 data science project or notebook that I built. How do I share it with my users, stakeholders, teammates?
35:18 Do I need to learn FastAPI or Flask or maybe Vue or ReactJS? Hold on now. Those are cool technologies,
35:25 and I'm sure you'd benefit from them, but maybe stay focused on the data project. Let Posit Connect handle
35:30 that side of things. With Posit Connect, you can rapidly and securely deploy the things you build in Python.
35:35 Posit Connect, Streamlit, Dash, Shiny, Bokeh, FastAPI, Flask, Quarto, ports, dashboards, and APIs.
35:42 Posit Connect supports all of them. And Posit Connect comes with all the bells and whistles to satisfy
35:48 IT and other enterprise requirements. Make deployment the easiest step in your workflow with Posit Connect.
35:54 For a limited time, you can try Posit Connect for free for three months by going to talkpython.fm/posit.
36:01 That's talkpython.fm/posit. The link is in your podcast player show notes. Thank you to the team at
36:08 Posit for supporting Talk Python.
36:13 Posit Connect: Let's talk about some career advice. I mean, I know you talked about being
36:18 connected on LinkedIn pretty well and certainly having some kind of social network is important. And they
36:23 maybe, it's not that you would call it not social, but a real world network of actual human beings that
36:29 you're, you know, physically know somehow. Posit Connect: What's that? I don't know what that is.
36:32 Posit Connect: I know. Like, we gave that up back in 2020, I thought.
36:35 Posit Connect: Yeah. Posit Connect: Anyway, like, there was some stat that I saw somewhere that,
36:38 you know, over half of the jobs are filled filled before even becomes a job posting, right? Maybe
36:44 some of the best ones is like, hey, who knows somebody who can do this? We need some, like your
36:49 data science example, data scientist example. They quit like, oh, we need somebody. Does anybody know
36:54 good data science? I don't want to just go put it out on the open job market and have to have a hundred
36:58 interviews and who knows what I'm going to get. Like, if you can recommend somebody, let's start there,
37:03 right? So being in that group to be recommended, it's important.
37:07 Posit Connect: It's the key. There was a really interesting survey done on LinkedIn and they said,
37:12 it was kind of, it was done by the same person and Jordan Nelson, by the way, he said, "How do you
37:16 approach getting a job?" And then the next day he said, "How did you get your last job?" And 80% of
37:22 people, they use what I call the spray and pray method, which basically means you go and you apply to as
37:27 many jobs as you possibly can and hope for the best. Cross your fingers. That was 80% of what
37:32 people were doing. And then on the next poll, the next day, it was a total, I think of what, 70% were
37:38 either headhunted, recruited or referred. And so it's like the Pareto principle here where, you know,
37:43 80% of the effort is only getting you 20% of the results. And really 20% of the effort gets 80% of the
37:50 results. So it's okay. We know networking and getting recruited is really important, but how do we do it?
37:55 It's easier said than done. And like you said- In the industry, how do I make friends who are,
37:59 right? It's like, well, my neighbors don't do it. So I guess I'm out.
38:02 That's the tricky thing is, is yeah, if you're not in the industry yet, how do you get recruited
38:06 into it or how do you know someone? And what I've come to learn is it actually doesn't even matter.
38:11 So like, for instance, let's take, let's take your neighbor, right? Your neighbor is probably not a data
38:16 scientist. Maybe you're lucky and they are, and they can refer you to a company. But what's really cool is
38:20 I've learned that companies really come to trust their employees and their employees'
38:24 recommendations. And so even if your neighbor, let's say is a web developer, or maybe even less
38:30 technical, let's just say your recruiter is in finance, right? And if there's an opening,
38:35 like a data science opening at that company, a lot of the times they will actually take their
38:39 employee referrals much more seriously than any sort of cold application that they get.
38:44 And so a lot of the times I've had students who just know someone that works at the company,
38:49 they saw a job opening pop up. They're quickly, they message their friends. Hey,
38:53 do you know a recruiter or a hiring manager? I could talk more about this role. Could you do
38:57 an internal referral for me? And they were able to land jobs that they probably wouldn't have.
39:01 No, they definitely wouldn't have without that internal referral. So it is tricky. It's the old
39:06 cliche. It's not, it's not what you know, it's who you know.
39:08 I think there's still plenty of ways, COVID notwithstanding. I think that these days,
39:12 there's plenty of ways to get those connections, right? But maybe people don't know, like meetup.com
39:18 is really good. If you live in a non-tiny city, there's many, many things going on that around data
39:25 science, around Python, around other data engineering, whatever, right? You could go to those things.
39:31 They're typically even free. Often they are free with food. They even feed you, right? And make connections,
39:37 or regional conferences or national conferences, right? Like we probably, many people have heard
39:42 of PyCon, right? There's US PyCon, there's EuroPython, and then there's, but that's, those are the ones
39:48 that are often talked about, but there's 10, 20 little smaller regional ones in the US and many more that
39:54 I'm not aware of throughout the world. Probably one of those within driving distance, right? That you could
39:59 go to make connections and just also kind of take the temperature of actually what, what you see on the
40:05 internet versus what you see and actually talking to real people. So I'd also say, just get out there.
40:10 A hundred percent. Those places have the people who probably want to hire you because they're local,
40:17 right? Which is one thing that's, that's trouble on LinkedIn. I'm, I'm big on networking on LinkedIn,
40:21 but a lot of the times you're going to be networking with people who in all likelihood might never have a
40:26 role that's even open to you. But the people that you're like, for instance, we have,
40:30 I'm in Utah and we have Silicon Slopes that has like a tech meetup. We have a local Python
40:36 meetup chapter. We have the big data and developers conference that that's free every year with tons
40:41 of food. And the people who go there are people from companies around there that have the openings
40:47 that you're trying to find. And they want to hire people like you who are in the area. So at least
40:51 you can maybe come to the office once a week or maybe once a month or whatever. Right. And so really,
40:55 like you said, going to those meetups, it's tough because networking is always difficult,
41:00 either online or in person, but at least in those situations, you know, Hey, these are people that
41:05 are tied to real companies that exist around me that do make data higher. So I have a chance.
41:10 Definitely a much higher chance than just shooting out a resume. All right. Well, let's see.
41:13 We talked about job hunting already. What about like applications and resumes? What are your thoughts on
41:20 that? I think once again, with the applications, the more targeted that you can make it, the better,
41:26 right? So if you can really hone in on, I really want this job, I'm going to cold message five people
41:32 at this company and see if I can get that internal referral one way or another, make a real connection
41:37 with them. I think that's really key. And then with resumes, resumes are more of an art than they are a
41:42 science. I feel like they are so difficult to figure out. And these ATSs that are trying to match you
41:49 and see if you're a good fit. I've tried a lot of them and a lot of them suck. Whoever's the data
41:53 scientist behind those, we need to have a conversation with them because it's, it's a little tricky
41:57 sometimes. But one of the coolest concepts I've been introduced to recently, and I have a whole
42:02 episode on my podcast about it is A, B testing your resume. And basically the idea is a resume's job
42:10 is just to get you a screener interview or like a beginner interview, basically. Right. That's all an
42:15 interview. Like no one's seeing a resume and then hiring you. They're always going to interview.
42:19 So if you think about it, a resume's job, the only job it has is to convince someone to get on the phone
42:25 and talk to you. And it's just a piece of paper. And guess what? You can put whatever you want on
42:29 that piece of paper. Now I'm not saying to lie, but I'm just saying you could theoretically make a
42:35 perfect resume for whatever job you're trying to go for and send it out there and see what happens.
42:39 Right. But I'm not saying to do that. I'm not saying to lie. My point in saying this is that the resume
42:43 is just to get you the interview. And if you're not getting interviews, something's probably wrong
42:48 with your resume. And so, you know, tweak something, apply to 10 more jobs, see what happens. Tweak
42:54 something, apply 10 more jobs, see what happens. Until you finally have the right combination, skills
42:59 of experiences of different keywords. Because a lot of the time you're just trying to beat the ATS. And
43:04 that's the sad part about it is it's like, how do I prove to this random computer algorithm that they should
43:10 talk to me on the phone? That's a hard game to beat. And there's a whole bunch of advice
43:14 from all these different people. What I've come to learn is it's different for every company. It's
43:18 different for every person. You kind of kind of a numbers game till you get lucky and you figure it
43:22 out. That's good advice. I guess two thoughts. One is I know that speaking specifically to anyone,
43:28 one. But in general, women wait until they match all the requirements of a position where a guy's like,
43:34 I know three of those things. I'm taking a flyer. I'm sending it. I would just like to encourage the
43:41 women out there to just send it as well. I 100% agree with that. And I think if you reach 60% of
43:47 the requirements, I think you have a chance. Like it's a lot of the times those are wish lists and not
43:52 actual requirements. And depending on, are you local to the area? Do you have a domain experience
43:58 in this company? Like there's lots of other factors. What about contributing to open source
44:02 or having GitHub repos that can be like projects that you can show off or what's your advice there?
44:08 I'm a huge proponent of projects in the portfolio. I think if you don't have experience with something,
44:13 you create your own by building a project. And if you can do that with open source,
44:18 I think you should totally do that because I've benefited so much from open source. I have not
44:23 given back as much as I should to open source development and projects. I definitely should do that.
44:29 But if you can find a project that you're passionate about that you can help with, I think you should
44:33 totally do that. Even if it's not open source and you're just building a project to showcase your skills,
44:38 I'm all about that. I think you can do projects that are super fun, maybe that are good for your
44:43 community or good for your life. I'm a huge fan of personal projects. I've put a Fitbit on
44:48 my dog before and looked at her steps. I've found the healthiest meal at McDonald's. I've looked at,
44:54 like visualized my weight over time and tried to create like different, like forecasting models and
44:59 stuff like that. There's so much data in our lives that you can use to make really cool projects.
45:03 Oh, absolutely. You talked about, okay, you get your first job and that's where you kind of really
45:08 learn. But if you don't have your first job, you can effectively simulate that. Say, I would have gone
45:14 on to a job and been given a project to analyze something. I'm just interested in this thing. I've got
45:18 two hours a day until I get a job that I can be inspired about this and just get going on it. Maybe
45:24 create a website and publish your results and it can draw more people in to actually see that, right?
45:29 And start to appreciate it. They could even ask like, all right, who's behind this cool project?
45:34 Maybe I want them to come work for me. Little did they know you're doing all this work because you got
45:39 some spare time and you're trying to build up your experience and a self-guided study, right?
45:43 Yeah. If you can build a cool project and flip the job hunt where you're not applying for jobs, but jobs
45:49 start to apply for you, you're in such a good position and doing really cool projects can help you get there.
45:54 Now it's hard to do cool projects. It's hard to publish projects, which is one of the things
45:59 that people really struggle with. For all you Python listeners out there, let me just tell you,
46:04 Streamlit is absolutely amazing because it makes the deployment process so easy. It's free. It's a
46:12 little tricky to deploy at first, but compared to what you used to have to do it back in the day,
46:17 I'm saying back in the day, like four years ago, basically. But it was really hard to deploy
46:23 something where you could send someone a URL. Hey, check out my web application, machine learning
46:27 application. Streamlit is such a cool app that makes it so easy and so intuitive to make these
46:33 cool little apps that you could just put on your resume, put on your portfolio, send to recruiters. I'm
46:38 such a fan of the Streamlit app. I love it.
46:40 Yeah, it's super cool. There's a couple of those and Streamlit is definitely one of the really nice
46:44 ones there. There's also some hosting behind Streamlit as well these days, right? You don't even
46:51 have to set up a server or anything and just create it and put it up there.
46:54 That's what I'm saying. Back in the day, I used Dash a lot and I'm still a big fan of Dash. Dash
46:59 is more customizable than Streamlit and can do quite a bit more, but it's a lot more work to deploy it.
47:06 It's more like programming.
47:07 Yeah, it is more programming.
47:09 Programming the UI rather than just the behind the scenes.
47:12 Yeah.
47:12 You have to do both and you have to know a little bit about systems and data engineering and stuff
47:18 like that versus Streamlit kind of takes that, abstracts that away. But yeah, back in the day,
47:22 I used to make Dash web applications and deploy them on Heroku back when they had a free tier of
47:27 hosting and they've taken that away. So I don't even know what the go-to free hosting platform is
47:32 nowadays. I just, I moved most of my things to Streamlit and it's so nice.
47:35 Yeah. We got Shiny for Python now, which is also nice.
47:38 I haven't checked that out. How is it?
47:40 I haven't done too much with it either, but Joe and the team over there are doing pretty cool stuff,
47:45 like adding more dynamic interactive stuff to Jupyter, like running inside Jupyter and things. Yeah,
47:50 pretty cool.
47:51 I'll have to check it out.
47:52 I think they also do a bunch of hosting stuff over there as well, is why it came to mind.
47:56 What other advice you got for folks out there?
47:58 So AI is AI, not studying AI or learning to use AI, machine learning, but is there a benefit of trying
48:05 to use ChatGPT to help you get this job or is there a danger? I'm thinking, for example, have ChatGPT
48:12 write me an awesome resume and then the tools are like, well, we've detected this is AI generated and
48:18 it's out. You know what I mean? What do you see happening there?
48:21 A lot of people see AI as like an all or nothing tool as in it's either you, the human doing the
48:29 work or it's the AI doing the work. But whenever, I don't know about you, but whenever I'm using
48:33 ChatGPT for anything, it's very rare it's copy and paste for me or at least not iterative where I'm
48:40 doing multiple prompts, prompt after prompt after prompt, trying to tweak it exactly what I want.
48:45 And so the way I look at ChatGPT and other gen AI that will be coming out, that's only inevitable,
48:50 is instead of looking at does this replace me? Does this, like for instance, am I going to build
48:55 my whole resume using ChatGPT? Can ChatGPT build, you know, take a data scientist's job and build the
49:02 whole model for them? I like to see it more as like a hammer. It's like a tool for the data scientist or
49:07 a tool for the job searcher to use in conjunction with your screwdriver or anything else. It's like
49:13 something to be wielded by a human, not replaced for the human, if that makes sense.
49:18 You know, it's really good for stuff like, hey, I know a regular expression will do this.
49:22 Yeah.
49:23 The last time I studied, I completely forgot what this is about. And I know it's gnarly,
49:27 but if I just ask, here's an example, here's what I want. Boom. And traditionally what you would end up
49:32 doing is you'd be on Stack Overflow. Yeah.
49:35 You'd be all over the internet. You'd be trying to piece it together from external information anyway.
49:38 And so code is something that's a little bit more in the wheelhouse of the generative AI,
49:44 because it can't really make it up as much. I know it could like do something insecure and you didn't
49:50 know it was or whatever, but it's not like asking for legal advice where it makes up cases that didn't
49:55 exist. Like it gives you code. You put it in the runtime of the compiler and it runs or it doesn't.
50:00 And the output comes like you did.
50:01 Yeah. It works or not.
50:02 Yeah. So it's pretty, pretty effective for that. But yeah, for resumes, I would be more like,
50:08 let me ask it. What are the in demand things? And if I know these three skills, what other skills should
50:14 I know to get a, you could sort of use it in an explorative way to then come up with what you
50:20 might write for yourself, right? Something like this.
50:22 I find it really useful for brainstorming like action verbs on your resume bullets. Like I think
50:27 it's really good at that. What's 10 different ways to say lead. So I don't say lead five times on my
50:33 resume and I use some different action bullets. I think it's great at that. I personally, it's pretty
50:38 rare that I start any Python code from scratch nowadays. I'm either starting hopefully from a
50:44 template that I've already written, or I'm starting from a ChatGPT. Like this is what I kind of want
50:49 to accomplish, right? Like the outline for it. Like one of the things I hate doing is I make a lot of
50:55 streamlet apps. I probably make a streamlet app a month right now. And I hate starting from scratch
50:59 with streamlet. It's super easy to start from scratch, but I'll say, Hey, ChatGPT, I want to
51:03 build a streamlet app. This is like the component I want here. This is the component I want here.
51:06 This is the component I want here. And it's almost like a warmup for me as a programmer. And it will
51:12 create something that works. It's not what I want. And I spend the next five hours trying to make it
51:18 what I want, you know, without ChatGPT, but it kind of gives me a warm start to my programming process.
51:23 So I really like it. I think it's something that everyone should use. And I think if you're thinking
51:29 about getting into any sort of programming, you know, whether it's data science or web development,
51:34 I think you should be a little bit less worried about it taking your job and job security. I think
51:40 you should almost be more excited that, wow, the bar has never been lowered to break into tech.
51:45 Like this is a step up gift from the programming gods that I get to use to break into tech.
51:51 Another thing to keep in mind is I imagine a lot of people listening to this podcast are not just
51:56 starting a college program, right? They're coming from possibly other experiences, other specialties.
52:03 You know, what's really good for job security, knowing the intersection of two things, the
52:07 intersection of chemistry and programming, the intersection of geology and programming for Exxon,
52:13 potentially, right? Like those things take you from a pool of a thousand to a pool of tens,
52:19 tens, right? And so what's awesome about that is it means two things. You don't throw away,
52:24 if you got a degree in something else like biology or whatever, you don't throw away like,
52:28 well, that was wasted four years. That's out. And it slices the pool of people who could apply for
52:33 certain jobs way, way smaller, right? Sounds like you agree.
52:36 Oh, a thousand percent. I'll just tell a quick little anecdote. When I was at ExxonMobil,
52:41 there's a lot of things I did not like at ExxonMobil, but this is something I really liked. It's about
52:46 once a quarter, they would do a crowdsourced data science competition for the whole organization,
52:51 like around the entire world. And they would say, this is a business problem we're trying to solve,
52:56 you know, and at Exxon, we have data scientists all over the world and like all sorts of different
53:00 teams and things like that. So I like did not know all the data scientists at Exxon. And they'd say,
53:05 this is the problem we're facing. Here's the data go, right? And I loved participating in these. It was
53:10 like right up my, my wheelhouse of like, I really enjoy exploration and all this stuff.
53:15 At the time I was getting my master's degree, but I didn't have my master's degree.
53:18 And I was competing against, so I'm just a chemical engineering grad, right? And I'm competing against
53:24 people with PhDs in computer science and in data science and all these like people who have way
53:29 more experience than me. And I actually won a few of these competitions. Thank you. I appreciate it.
53:35 And it's not because I was a better programmer or a better data scientist. It's because I majored in
53:40 chemical engineering and I knew the business problem, the domain extremely well. And I kind
53:46 of knew the programming and the data science stuff, but the combination of them made me very valuable.
53:51 Like one of the best examples I have is we're looking at crude oil properties. And I remember
53:56 like there was a forum where you'd like ask your questions. And one of the, one of the data scientists
54:01 asked, Hey, is sulfur bad? There's lots of sulfur in this. Is it bad? And like to a chemical engineer,
54:06 that's like the most obvious thing. No, you, yes. Sulfur is very bad in crude oil. That's very,
54:11 no, no, that's like such a fundamental thing to me and to him or her. That was like groundbreaking.
54:17 And so, yeah, your domain can become your superpower in your career.
54:20 Yeah. And it makes it way harder for ChatGPT and other types of tools to just automate you out of a
54:26 job because you bring in all these skills together, which is awesome. But it also makes it easier for
54:31 you to get the job. It makes it easier for you to continue your momentum of whatever you've been up
54:35 to. It's just, it's good all around. Yeah. I think it's more fun too, because once again,
54:39 like when I was trying to decide if I should study computer science, I was like, man, I don't really want to
54:45 to build an Excel workbook for building an Excel workbook sake. That's still true for me today.
54:52 I don't want to do data science for data science sake. I only like machine learning or data science
54:57 when I'm doing it to solve a really fun problem I'm passionate about. That's where it's more fun.
55:01 So if you can be excited about the domain and excited about the algorithms, I think that's a
55:05 great place to be. Absolutely agree. All right. We're getting short on time,
55:09 but maybe tell us a bit about your data career jumpstart. You've referred to it a couple of times.
55:14 Yeah. I have a company called data career jumpstart. I just try to do a lot of education. So the
55:18 education happens on LinkedIn happens on YouTube. And I actually forgot to mention this at the
55:23 beginning, but I have my own podcast called the data career podcast, where I help people land their
55:28 first data job. We're about at a hundred episodes. So not quite the groundwork that you've put in.
55:33 That's still a ton. That's awesome.
55:35 Yeah, we're getting there. And then, yeah, I also have a bootcamp where I try to affordably help
55:40 people land their first data analyst position by teaching them the skills, the networking and the
55:46 project and portfolio building that they need to do something.
55:50 Like the long version of this show.
55:51 Yeah. Basically. Well, yeah. Just take what we talked about today, expand on it,
55:56 make it like 350 unique lessons. And that's exactly what it is.
56:00 Yeah. Very cool. All right. Well, we're about out of time. So maybe just every final call to action,
56:06 people maybe are inspired. I see Dave go out in the audience. That's an awesome talk. Very much so.
56:11 What's next. It's easy to be inspired, but you got to take action.
56:15 Yeah. I love that. I think it's always fun to listen to podcasts, but you probably benefit way
56:21 more from the action you take after a podcast. So for you guys who are maybe interested in a data
56:26 analytics or a data science career, explore that. If you're like, yes, I'm in, make a plan, make a
56:32 roadmap. If you need help, I have a webinar that will help you make a roadmap. What skills should you
56:36 learn? How should you be networking and stuff like that? But really probably if you're just getting
56:40 started trying to figure out what skills you should learn, like what are the top skills that you should
56:44 be learning and then learning those skills and then not only learning those skills, but take action and
56:48 learning and build some sort of a project that we talked about that you could put on a portfolio,
56:52 make a streamlet app or something like that. That's probably the best action you could possibly take.
56:57 If you need any ideas, advice, feel free to check out my website, datacareerjumpstar.com or the
57:02 podcast data career podcast. Hopefully there's a lots of free resources for you guys to check that out.
57:07 If you've never seen streamlet before, I have some YouTube videos about streamlet that you guys can check out, but I love it. Just take action somehow, do something.
57:13 That's one of the huge, huge differentiators is like, you might be inspired, but you just got
57:18 to start taking those steps and it becomes a snowball. So thanks for sharing all your experience and your
57:23 advice. Hopefully some people out there are taking action and yeah, I'll put everything we talked about
57:28 in the show notes, of course. So thanks for being here, Avery.
57:30 Yeah. Thank you. Thanks for having me. I appreciate it.
57:32 You bet. Bye all.
57:33 This has been another episode of Talk Python to Me.
57:37 Thank you to our sponsors. Be sure to check out what they're offering. It really helps support the
57:41 show. Take some stress out of your life. Get notified immediately about errors and performance
57:47 issues in your web or mobile applications with Sentry. Just visit talkpython.fm/sentry and get
57:54 started for free and be sure to use the promo code talkpython, all one word. This episode is sponsored
58:00 by Posit Connect from the makers of Shiny. Publish, share and deploy all of your data projects that
58:05 you're creating using Python. Streamlit, Dash, Shiny, Bokeh, FastAPI, Flask, Quarto, reports, dashboards and APIs.
58:14 Posit Connect supports all of them. Try Posit Connect for free by going to talkpython.fm/posit.
58:21 P-O-S-I-T.
58:22 Want to level up your Python? We have one of the largest catalogs of Python video courses over at Talk Python.
58:28 Our content ranges from true beginners to deeply advanced topics like memory and async. And best of all,
58:34 there's not a subscription in sight. Check it out for yourself at training.talkpython.fm.
58:38 Be sure to subscribe to the show, open your favorite podcast app, and search for Python.
58:43 We should be right at the top. You can also find the iTunes feed at /itunes, the Google Play feed at /play,
58:50 and the direct RSS feed at /rss on talkpython.fm. We're live streaming most of our recordings these
58:57 days. If you want to be part of the show and have your comments featured on the air, be sure to subscribe
59:01 to our YouTube channel at talkpython.fm/youtube. This is your host, Michael Kennedy. Thanks so much
59:08 for listening. I really appreciate it. Now get out there and write some Python code.
59:20 I'll see you next time. I'll see you next time.