Land Your First Data Job
Episode Deep Dive
Guest Background: Avery Smith
Avery Smith is a data scientist, instructor, and consultant who took a unique path into data careers. He started out studying chemical engineering, discovered a passion for programming through MATLAB, and later transitioned into data science. Avery has worked at both small startups and large enterprises (including ExxonMobil), implementing machine learning solutions and translating data insights into real business impact. Now, he runs Data Career Jumpstart, mentoring aspiring data professionals on how to land their first data job.
1. Navigating the “Zero-to-One” Challenge
Breaking into a data role can feel like the toughest leap. Avery’s journey from lab technician to junior data scientist proves non-traditional routes truly can work. Once you land that first job, opportunities for growth multiply quickly, making future moves far easier.
- Many encounter the catch-22 of needing experience to get a job, yet needing a job for experience.
- Smaller companies or internal transfers (like Avery’s lab-tech-to-data-scientist story) can help you break in with fewer prerequisites.
- Once you have that first position on your résumé, each subsequent career move becomes much smoother.
2. The Power of Networking
Networking exposes you to hidden job openings and referral paths you’d never see by just applying online. Often, roles are filled before they’re publicly posted, emphasizing the value of genuine connections. Whether you meet people at local conferences or within your own organization, building relationships can significantly boost your job-hunt success.
- Many opportunities are filled via referrals before ever hitting public job boards.
- Local meetups and conferences are prime avenues for making personal connections.
- Even non-technical coworkers can help you find referrals within their companies.
3. The Role of Python
Python’s expansive ecosystem, user-friendly syntax, and vast community make it the go-to language in data science. From scikit-learn and PyCaret for machine learning, to Streamlit for quick app deployments, Python offers a streamlined way to build and share data projects. For those starting out, it’s one of the fastest paths to tangible, demonstrable success.
- Tools like Data Nerd show how in-demand Python is for data jobs.
- Its libraries and frameworks, including scikit-learn, PyCaret, and Streamlit, cover a broad spectrum of applications.
- Python’s lower barrier to entry helps new data professionals learn quickly and showcase value fast.
4. SQL as “Table Stakes”
Although you can do a lot with Python, SQL remains essential for working with large, structured data. Companies typically house critical information in relational databases, so querying proficiency can make or break your day-to-day success. When data extends beyond basic CSVs, SQL skills often become indispensable.
- Mastering fundamental commands (SELECT, JOIN, WHERE, GROUP BY, etc.) already makes you valuable.
- Many companies store massive amounts of data in relational databases, making SQL knowledge essential.
- Once data grows beyond basic CSVs, SQL queries are often indispensable for efficiency and performance.
5. The Power of Domain Knowledge
Technical prowess matters, but so does understanding your industry. Avery’s chemical engineering background helped him excel at ExxonMobil, because he grasped nuances—like sulfur levels in oil—that pure data scientists might overlook. Blending domain expertise with data skills can make you a vital asset in any organization.
- Avery excelled in internal data competitions at ExxonMobil because he knew how sulfur levels affected oil refining.
- Combining data expertise with fields like biology, finance, or engineering gives you a unique edge.
- Clear insight into the “why” behind data leads to more accurate models and solutions.
6. Creating a Portfolio with Accessible Tools
Portfolios let you prove your abilities instead of just claiming them. By building small, focused data projects or interactive dashboards, you offer tangible evidence of your skills. Tools like Streamlit or Plotly Dash make deployment more approachable, allowing you to share your work with potential employers.
- Tangible demos, such as interactive dashboards or web apps, let your skills speak for themselves.
- Streamlit and Plotly Dash make it simpler to convert notebooks into live apps.
- Deployment is far easier today—once-challenging steps can now be done quickly to showcase your work publicly.
7. AI as a Helper, Not a Replacement
Generative AI tools like ChatGPT can streamline coding tasks and brainstorming, but they don’t replace human judgment or creativity. Avery routinely uses AI-generated scaffolding for his Python or Streamlit projects, then refines the code manually. This lets him move faster without sacrificing the critical thinking that ensures robust solutions.
- Avery uses ChatGPT as a starting point for quick code scaffolding in Python or Streamlit projects.
- It’s an iterative process: AI suggestions are refined or corrected as you build.
- AI can also help brainstorm resume phrasing, highlight key skills, and speed up research.
8. A/B Testing Your Resume
A résumé’s main function is landing interviews. If it’s not working, iterate on it as you would a data experiment. Make small changes—like tweaking keywords or clarifying an accomplishment—then apply to a few more jobs to see if your response rate improves.
- Change one aspect, apply to 10 openings, track results, and iterate.
- Align keywords with role requirements to get past Applicant Tracking Systems (ATS).
- Showcase specific outcomes—how you improved processes, tackled real data sets, or deployed a working model.
Notable Links and Resources
Below are the tools, libraries, and websites explicitly mentioned or referenced:
- Avery Smith on LinkedIn
www.linkedin.com - Data Career Jumpstart
datacareerjumpstart.com - Data Nerd (Job Trends Site)
datanerd.tech - Write C# LINQ queries
learn.microsoft.com - Streamlit
streamlit.io - Plotly Dash
dash.plotly.com - scikit-learn (machine learning library)
- PyCaret (AutoML library)
- Shiny for Python (RStudio/Posit)
Overall Takeaway
Landing your first data job calls for a balanced blend of networking, skill-building, and showcasing real-world applications. Python and SQL remain central pillars, while domain expertise can elevate you far beyond what purely technical skills can achieve. Whether you’re spinning up a Streamlit demo or optimizing your résumé with an A/B testing mindset, tangible evidence of your capabilities is key. And remember—once you push through “zero to one,” every step afterward tends to get a little easier.
Links from the show
Data Career Jumpstart: www.datacareerjumpstart.com
Data Nerd Site: datanerd.tech
Write C# LINQ queries to query data: learn.microsoft.com
A faster way to build and share data apps: streamlit.io
Plotly Dash: dash.plotly.com
Michael's Keynote: State of Python in 2024: youtube.com
Watch this episode on YouTube: youtube.com
Episode transcripts: talkpython.fm
--- Stay in touch with us ---
Subscribe to Talk Python on YouTube: youtube.com
Talk Python on Bluesky: @talkpython.fm at bsky.app
Talk Python on Mastodon: talkpython
Michael on Bluesky: @mkennedy.codes at bsky.app
Michael on Mastodon: mkennedy