Learn Python with Talk Python's 270 hours of courses

Python in Excel

Episode #446, published Fri, Jan 26, 2024, recorded Thu, Nov 16, 2023

Why is Python so popular? There is plenty of room for debate on this but one solid reason is it's easy to adopt, easy to use, and caters to people who are not quite developers/data scientists but need to do some computing. Do you know where there largest untapped set of that group hang out? Excel. That's why it's super exciting that Python is now going to be built directly into Excel. Just go into a cell and type =PY and you're off writing full Python 3 code that is backed by a lite Anaconda distribution of Python. And we have Dr. Sarah Kaiser here to give us the rundown on Python in Excel.

Episode Deep Dive

Guest

Dr. Sarah Kaiser is a seasoned Python developer and a Developer Advocate at Microsoft. She has an extensive background in data science and machine learning, shaped by her PhD work in quantum cryptography hardware and experimental physics. Her experience ranges from reverse-engineering laboratory instruments in quantum labs to building visualizations and data tooling at Wolfram (Mathematica). In her current role, she focuses on improving Python’s data science and machine learning ecosystem, bridging the scientific community with robust cloud and developer tools.


What to know if you're new to Python

  • A background in spreadsheets and some programming concepts is all you need to follow the conversation about Python in Excel.
  • If you are just getting started, understanding basic Python syntax (variables, loops, functions) will help.
  • For hands-on, practical learning aligned with this episode’s focus, check out Move from Excel to Python with Pandas.
  • Familiarize yourself with how Python handles tabular data (Pandas DataFrames) and basic plotting libraries like matplotlib or seaborn.

Key Topics & Takeaways

  1. Dr. Sarah Kaiser’s Physics and Quantum Background

    • Sarah began her career working with Mathematica and experimental setups for quantum cryptography. She later embraced Python to automate lab equipment and unify complex instrumentation protocols.
    • Building satellite-based quantum key exchange experiments and working on data-gathering systems taught her how critical Python is for experimentation, reproducibility, and large-scale data tasks.
  2. Python in Excel: Why It’s Exciting

    • Python’s popularity partly stems from its accessibility and community support. Integrating Python directly into Excel targets a massive audience of non-developer spreadsheet users who still need serious computing.
    • Writing =PY in a cell and executing Python code in Excel is a huge leap in bridging data analysis, automation, and collaboration without leaving the familiar spreadsheet environment.
  3. Collaboration and Enterprise Considerations

    • One major advantage of Python in Excel is the simplified workflow for sharing and revising calculations within a team. No more sending PDFs or CSVs back and forth—Excel’s shared docs now incorporate Python’s power directly.
    • IT administrators and data stakeholders benefit from robust security, since the Python environment is containerized in Azure and isolated from external dependencies.
  4. The Locked-Down Container and Security Model

    • Any Python code you run in Excel actually executes in a secure Azure container—no local Python installation is required, and there’s no network access from Python cells.
    • This approach streamlines maintenance (the environment is pinned and reproducible), enforces compliance in enterprise settings, and allays fears of rogue scripts or unverified packages.
  5. Supported Python Libraries and Data Flow

    • By default, you have core data science libraries: pandas, numpy, matplotlib, seaborn, scipy, scikit-learn, sympy, PyTorch, and more, thanks to a lightweight distribution maintained by Anaconda.
    • Data flows in two main ways: Excel cell ranges convert to Pandas DataFrames (via an xl interop object), and Python outputs can be rendered back into Excel as static tables or objects.
  6. Existing Tools for Python-to-Excel Integration

    • The conversation references popular Python libraries such as openpyxl and XlsxWriter for generating Excel files from Python. These remain valid when you need finer control or non-standard file automation outside the new feature.
    • Sarah highlights how “Power Query” in Excel handles external data imports, complementing Python’s lack of open network access within this containerized setup.
  7. Quantum Cryptography & Scientific Context

    • Sarah’s quantum cryptography stories—developing ground-to-satellite key exchanges and discovering vulnerabilities in real-world quantum devices—illustrate Python’s roots in serious research.
    • Many data scientists and researchers only discover the importance of version control and DevOps best practices mid-research. Python in Excel could lower the barrier for mixing coding and data analysis, possibly saving hours of ad-hoc workflow creation.
  8. Potential Gateway for Future Python Developers

    • Bringing Python to Excel could inspire many spreadsheet power users to deepen their coding skills. They see that Python provides easier syntax than complicated nested formulas, more robust plotting, and advanced ML libraries right at their fingertips.
    • As Sarah suggests, once people feel confident manipulating data in Python cells, they often ask, “What else can Python do?”—and find themselves exploring notebooks, broader Python libraries, or even specialized courses.
  9. Working Styles and Tips (Bonus Topic)

    • Sarah often starts by placing “init” or “import” code at the top-left cell of the first worksheet. Excel’s calculation order (left-to-right, top-to-bottom) ensures that cell runs first, mimicking a Jupyter “setup cell.”
    • Short, readable Python code blocks are crucial for maintainability within Excel. Long multi-line Python statements in a single cell can be cumbersome—modularity and clarity still matter even in spreadsheet land.

Powerful Quotes or Stories

  • “When I got to grad school, I realized none of our instruments talked together... I started learning Python to make our lab work.” — Sarah explaining how her physics research forced her into Python for instrument automation.
  • “We had a $5 million microscope get bricked because someone plugged a Windows XP Service Pack 0 machine into the internet.” — A cautionary tale on poor software practices in scientific labs.
  • “My first programming language was Mathematica, so I’ve always liked those notebook-like interfaces. Jupyter was a perfect fit.” — Sarah recalling her path into interactive scientific computing.

Overall Takeaway

Python in Excel marks a significant milestone for data analysts, scientists, and everyday spreadsheet enthusiasts. With a secure, container-based environment and access to essential data science libraries, this integration shortens the gap between ad hoc analysis and full-fledged programming. Dr. Sarah Kaiser’s experiences highlight how a tool like Excel—familiar to millions—can become a springboard for deeper involvement in Python, whether for advanced data science, machine learning, or everyday data wrangling. This episode inspires us to embrace Python’s versatility in new frontiers and reflects on how inclusive tooling can open doors for the next wave of data-driven innovators.

Links from the show

Sarah's website: sckaiser.com
Sarah on Mastodon: @crazy4pi314@mathstodon.xyz

Get started with Python in Excel: microsoft.com
Python in SQL Server: microsoft.com
8 of the Biggest Excel Mistakes of All Time: blog.hurree.co
Security and Python in Excel: microsoft.com
Episode transcripts: talkpython.fm

--- Stay in touch with us ---
Subscribe to Talk Python on YouTube: youtube.com
Talk Python on Bluesky: @talkpython.fm at bsky.app
Talk Python on Mastodon: talkpython
Michael on Bluesky: @mkennedy.codes at bsky.app
Michael on Mastodon: mkennedy

Talk Python's Mastodon Michael Kennedy's Mastodon