Stroll Down Startup Lane - 2026

Episode #551, published Thu, Jun 11, 2026, recorded Thu, Jun 11, 2026

0:00

01:48:54

Links Episode Deep Dive Transcript

Panelists

If you've ever been to PyCon, you know one of the best parts of the expo hall is Startup Row, a stretch of booths where early-stage companies built on Python show off what they're creating. But only attendees get to walk that lane, so let's bring it to everyone. In this episode, we stroll down Startup Row together. We kick things off with the organizers, Jason and Shay, who share the program's origin story going back to Paul Graham and the PSF, plus some surprising stats, including two unicorns among the alumni. Then we meet five startups: Tetrix, bringing AI to institutional investing in private markets. Arcjet, security that lives inside your app as an SDK. Phemeral.dev, serverless hosting built for Python web apps. CapiscIO, an identity and authority layer for AI agents. And Pixeltable, a multimodal database from Marcel Kornacker, co-creator of Apache Parquet. See if you can spot the theme running through them all. Let's go for a walk.

Episode Deep Dive

Guests Introduction and Background

This episode is a walking tour, so instead of one guest we meet eight people across six segments.

Jason D. Rowley is the co-organizer of Startup Row at PyCon US. He came to the program around 2015 as a financial journalist covering startups and venture capital, was talked into emceeing a pitch event in Chicago, and fell in love with the Python community. This is his 10th year organizing Startup Row, and he has been the primary organizer since 2019.
Shea Tate-Di Donna co-organizes Startup Row alongside Jason. Her background is in early-stage venture: she was part of the founding team at True Ventures, where she spent seven years across three funds. She later founded a full-stack Python ed tech SaaS company that was itself featured on Startup Row at PyCon 2015 in Montreal and was later acquired, so she has lived the program from both sides, first as a founder and now as an organizer and judge.
Naunidh Singh Bhalla is the co-founder and CTO of Tetrix (tetrix.co), an AI investment platform for institutional investors in private markets. A lifelong technology nerd, he now leads the engineering side of the company.
Grant Gittes is a founding engineer at Tetrix and a former investment banker turned software engineer, which gives him firsthand knowledge of the manual document drudgery the product eliminates.
David Mytton is the founder and CEO of Arcjet (arcjet.com), a runtime application security startup. He also runs Console (console.dev), a developer tools newsletter with around 30,000 subscribers that he has published for almost six years. Python was his first programming language, going back to the Python 2 era around 2008.
Chinmaya Joshi is the founder of Phemeral (phemeral.dev), a serverless hosting platform built specifically for Python web apps. This year was his first PyCon.
Beon de Nood is the founder and CEO of CapiscIO (capisc.io), an identity and authority layer for AI agents. He came from a C programming background and built CapiscIO after Google's A2A protocol convinced him the agentic web was missing core infrastructure.
Marcel Kornacker is the co-founder and CTO of Pixeltable (pixeltable.com), a multimodal database for AI applications. His resume is database royalty: database internals work in grad school, Google from 2003 to 2010 including the F1 hybrid transactional and analytic system, creator of Apache Impala at Cloudera, and co-creator of the Apache Parquet file format.

What to Know If You're New to Python

This episode is less about Python syntax and more about the ecosystem around it: conferences, web frameworks, packaging, and the AI infrastructure being built on top of the language. A little context on the following ideas will make the whole stroll click.

PyCon and Startup Row: PyCon US is the largest annual gathering of the Python community, and its expo hall includes Startup Row, a stretch of free booth space awarded to early-stage companies that build with Python. Knowing that this is a curated, judged program run with the Python Software Foundation explains why these five companies are worth your attention.
Python web frameworks and SDKs: Flask, Django, and FastAPI come up in nearly every segment as the standard ways to build web apps and APIs in Python. An SDK is simply a library you install (for example with pip) that wraps a company's service, and both Arcjet and CapiscIO deliver their security products this way rather than as external appliances.
Python's polyglot performance story: Python itself runs on CPython, which is written in C, and many fast tools layer Python over a compiled core written in Rust or Go. Several startups in this episode (Arcjet and CapiscIO especially) follow this exact pattern: a Go core for speed with a friendly Python SDK on top.
AI agents and their plumbing: An AI agent is a language model that can take actions through tools, and this episode is full of the emerging plumbing around them: MCP servers that expose tools to agents, installable "skills" that teach coding agents how to use a product, and llms.txt files that make docs agent-readable.
Hosting jargon, PaaS and scale-to-zero: A platform-as-a-service (PaaS) hosts your code without you managing servers, and "scale to zero" means your app costs nothing while idle but must "cold start" when traffic returns. That tradeoff is the entire technical story behind Phemeral.

Key Points and Takeaways

Startup Row is PyCon's on-ramp for Python-powered startups

The heart of this episode is the program itself: a row of expo hall booths PyCon awards to early-stage companies built on Python. Shea traced the origin back to 2011, when Paul Graham of Y Combinator collaborated with the Python Software Foundation to get more startups into PyCon, since many early-stage companies could not even afford a conference ticket, let alone a booth. Fifteen years later the program is still doing exactly that, giving founders free space to demo, recruit, and meet their users face to face. Shea's favorite scene is everyone gathered around a laptop at a booth, pointing at a demo and troubleshooting together, and one featured company told her how many product uploads they got during just their two days on the row. Since only attendees get to walk the lane, this episode brings the experience to everyone else.

The theme running through every booth: AI, with Python as its native language

Michael challenges listeners to spot the theme, and it is hard to miss: every one of the five startups is building with or for AI. Tetrix applies LLMs to private market documents, Arcjet detects prompt injection and helps enforce AI budget controls, Phemeral hosts the wave of Python web apps that AI-assisted builders are creating, CapiscIO gives AI agents identity and policy, and Pixeltable is a database designed for multimodal AI workloads. Shea put her finger on why this all lives at PyCon: Python has become the lingua franca of the large language model era. David Mytton saw the same thing from the vendor side, noting that as AI arrived in earnest around late 2024 and 2025, Python "started to shine again" because it was already native to the data science community, which pushed Arcjet to ship a Python SDK. If you want a single snapshot of where Python startups are in 2026, it is AI infrastructure all the way down.

Startup Row's track record includes two unicorns and a strong survival rate

Jason came armed with statistics. Roughly 170 to 175 companies have come through Startup Row since its inception. Looking at the seven batches from 2019 through 2025, about 60 companies, 32 are still active, 11 have been acquired, and 15 are no longer with us, which works out to roughly 12 to 15 percent acquired. Two alumni have achieved unicorn status, including Chainguard, the supply chain security company and returning PyCon sponsor that Jason pegged at around 3.5 to 3.8 billion dollars after its most recent funding round. As Michael noted, those numbers compare very favorably to the general population of startups. Shea added an explanation for why the row works: companies get to stand directly in front of their ideal customer profile, hearing feedback straight from developers for two days.

How to get on Startup Row: three rules and a December application

Jason laid out the qualification criteria. The one non-negotiable rule is that you must use Python somewhere in your stack, the more the better, with a slight implicit bias toward companies building open source projects. Companies should generally be younger than two and a half to three years, measured from incorporation or a major pivot, and teams should be roughly 25 people or smaller, with a preference for smaller. The program is deliberately not for 120-person Series B companies that would love free booth space; it is for small, early, high-growth ventures. Applications usually open around December since PyCon happens in May, a panel of judges with different backgrounds evaluates the short intake form, and decisions typically go out in January. Beon's story is encouraging here: Jason personally pinged him and suggested he apply, so the organizers are actively scouting Python startups, not just waiting for applications.

Tetrix brings AI to institutional investors in private markets

Naunidh broke down the mouthful of a pitch: Tetrix is an AI investment platform for institutional investors, meaning endowments, foundations, family offices, and pension funds, the pools of capital large enough to invest beyond public stocks and bonds into venture capital, private equity, private credit, and private infrastructure. The product attacks three pain points: collecting documents scattered across many sources, structuring and normalizing all that unstructured data, and generating analytics and insights that the market currently lacks. Grant, the ex-banker, described the before picture vividly: late nights manually typing values from PDF reports into Excel, exactly the drudgery LLMs are good at eliminating. The results Naunidh cited include compressing time-to-insight from 45 days to one day and saving customers thousands of manual hours while producing 10x deeper insights. The backend is fully Python: FastAPI for HTTP services, Pydantic for models and validation, and pandas and NumPy on the analytics side, plus OCR and data extraction tooling. The team came to PyCon in part to recruit, and they are hiring Python developers across the stack.

Keeping LLMs honest when the stakes are measured in billions

Michael pressed on the obvious worry: how do you keep probabilistic AI on the rails when clients are making enormous investment decisions? Naunidh's answer was a disciplined engineering playbook for deterministic outputs from a probabilistic system. First, Tetrix invested early in AI evaluation harnesses, with fully annotated datasets and a human in the loop, so experiments run fast and regressions surface immediately, because "you can't solve what you don't know." Second, the team encoded its financial domain knowledge into more than 250 financial rules that check every LLM output for whether it makes sense within a document and across time series. Third, feedback loops ensure every human-spotted error improves subsequent extractions. The result is an average out-of-the-box accuracy of 96 percent that keeps ticking upward, which Naunidh argues is the highest in private markets. It is one of the most concrete recipes for production LLM quality you will hear, and it applies far beyond finance.

tetrix.co

Arcjet puts security inside your app as an SDK, not outside it

David Mytton's founding insight is that developers have a reputation for not caring about security partly because security tools are not built for developers: they live outside the code, the editor, and the codebase. Arcjet flips that by shipping security as an SDK you drop into your application dependencies, with support for Python, JavaScript, TypeScript, and Go. Critically, it is framed around problems developers actually search for, like spam signups, abusive automation, and AI budget overruns, rather than products like a WAF or static analysis suite. The building blocks include bot detection, rate limiting, signup spam protection, PII detection, and prompt injection detection, and they compose into multi-rule security policies. Because the SDK runs inside your code, it has full context about the user, can log instead of block while you tune rules, and avoids false positives that external tools cannot reason about. David's analogy is DevOps: just as developers took ownership of operations, the right tools let developers treat security as just another feature to build.

Arcjet's architecture: WebAssembly sandboxes, gRPC, and GPUs on Modal

The implementation details are a tour of modern systems design. Local analysis like PII detection (emails, credit card numbers typed into a support form) runs in a WebAssembly sandbox that ships with the SDK and executes in process, in under a millisecond, so sensitive data never leaves your environment. Decisions that call Arcjet's edge network carry a 20 millisecond SLA for things like form submissions, with bot detection and rate limiting aiming for just a couple of milliseconds over a persistent HTTP/2 connection using gRPC, chosen because its binary wire protocol avoids JSON parsing overhead and gives protocol-level backwards compatibility. Prompt injection detection runs specialized models in parallel in Arcjet's cloud, comparing their results, served by a FastAPI ASGI app hosted on Modal for GPUs and autoscaling, abstracted behind the open inference protocol, adding roughly 200 milliseconds, which disappears inside a multi-second LLM call. The WebAssembly runtimes differ per ecosystem: Wasmtime for Python and wazero for Go, and that per-platform engineering is why new language support takes real work. Michael offered his own take that msgspec would be his choice for tight binary serialization in pure Python land, while conceding gRPC's multi-language story. And when Michael mused about more native WebAssembly execution in Python via C bindings, David's memorable answer was that he wants absolutely nothing to do with C in a security focused application.

Agent experience (AX) is becoming the new developer experience

A striking pattern across the row: companies are designing for AI coding agents as first-class users. Arcjet ships an installable skill so you can literally tell your coding agent "set up Arcjet" and it will examine your code, find the endpoints and tool calls that need protection, and wire everything up; afterwards the agent can follow logs, query data through Arcjet's MCP server, and propose rule changes to eliminate false positives. David noted that the homepage call to action is now "start with a prompt," and that good developer experience from 2023 (docs, examples, editor hints) turned out by accident to be exactly what agents need, including an llms.txt file for the docs. Pixeltable made the same bet: skills installable via npm, a uv-based starter template, and a data model whose type safety and compact code give agents guardrails. Marcel's framing is that when a complex pipeline is a few hundred lines instead of 10,000, there is far less room for the AI to drive the thing off a cliff. Michael connected this to a broader trend he is seeing with other dev tools companies investing in agent-ready docs, CLIs, and skills.

A supply chain security reality check for PyPI and npm

Michael and David took a sobering detour into package security. David observed that PyPI has not suffered the same level of attacks as npm, but argued that is largely a function of where attacker attention goes, and it will change. These registries were architected in an era of pure open source trust, essentially as volunteer-run code hosting, and even npm, run by GitHub under Microsoft, is struggling under attacks despite megacorp resources. Michael's analogy stings: pip install or npm install is inviting code you have never reviewed to run on your machine, the modern equivalent of early systems that asked for a username with no password. Both expect more vendoring in the future, with David predicting that coding agents will write simple dependencies for you, and Google's vendor-everything approach getting easier to imitate. The crucial exception is cryptography: some things you should never write yourself, and "we rolled our own crypto" remains a giant red flag. Arcjet's position in this world is protecting the production side, while pre-production tools handle code scanning and dependency management.

Phemeral is push-to-deploy serverless hosting built for Python

Chinmaya positioned Phemeral as a platform-as-a-service with a singular focus: Python web apps. You connect GitHub, push to a production branch, and merging a PR makes it live, with no Dockerfiles or containerization on your part; Phemeral's build pipeline inspects your codebase, figures out the tools and libraries in play, packages it, and deploys to their managed cloud. The technically interesting piece is a custom serverless orchestrator built on fast-starting VMs, so apps can scale to zero when idle without inflicting painful cold starts on the first visitor, while scaling instances up to match traffic. That solves the classic dilemma Michael described: keep a server warm and pay for idle, or scale to zero and make users wait. It hosts anything written in Python and exposed as an API, with Flask, Django, and FastAPI as the headline frameworks, though no GPU workloads for now. The roadmap centers on managed serverless Postgres deployed right next to your app, with managed Redis likely to follow, and the platform is in early access that anyone can sign up for today.

Phemeral's audience: small teams, agencies, and the new wave of AI-assisted builders

The Startup Row booth became a live customer research lab for Chinmaya. The strongest interest came from two groups that validated his ideal customer profile: small startup teams on a Python stack who would rather spend developer time on business logic than infrastructure, and agencies or consultants who juggle staging environments and handoffs for many clients at once. The agency case is especially neat: handing a finished project to a client becomes "push to this branch," and per-client staging environments stop multiplying into infrastructure work. Michael also raised the emerging cohort of vibe coders and data scientists, people who have built something real with AI assistance or notebooks but do not do Linux or DevOps, and Chinmaya agreed they are a massive new group of builders the platform serves well. The conversations that went nowhere were valuable too, helping define who is outside the target customer. It is a textbook example of why "talk to customers" is easier said than done, and why a booth full of developers for two days is so valuable to a young company.

phemeral.dev

CapiscIO is building the identity and authority layer for AI agents

Beon's light bulb moment came in March 2025 when Google released the first iteration of the A2A (agent-to-agent) protocol, alongside DeepMind papers on intelligent delegation: agents were about to communicate and self-orchestrate at scale, and the infrastructure to trust them did not exist. His analogy is the early 2000s internet, where the ideas arrived before the infrastructure, except this time builders know what is missing and are racing to close the gaps. CapiscIO grants every agent a cryptographic identity, with the private key stored on the agent or MCP server itself, registered with one line of code through the Python SDK. From that identity you attach policy: which tools an agent may invoke, which are blocked, and what group or organizational rules apply. Policies compile into OPA bundles cached directly on the agent or server, so authorization decisions resolve in under 10 milliseconds rather than adding the latency people fear from security layers. He also pointed at the missing trust primitives of the agentic web: humans check reviews before buying shoes, but no equivalent reputation mechanism exists yet for agents. The team is publishing its work as a public RFC stack, with an intent layer next, and is actively seeking design partners who are hitting these problems in production.

Air-gapped AI and the economics of compute came up on the lane

The CapiscIO segment wandered into one of the episode's most interesting hallway-track conversations. CapiscIO ships via Docker partly because enterprises ask to run it in their private cloud or fully air-gapped, and Beon met PyCon developers in highly sensitive environments, including one working on airplane voice and data recorders, where any AI must be sandboxed and air-gapped. Michael noted the NVIDIA and Anaconda booth combo at PyCon was demoing live AI on DGX Spark machines, and speculated that if subsidized AI pricing changes, buying a few thousand dollars of local hardware could become the economical choice. Both agreed today's API prices are heavily subsidized, and that the extreme load on compute is a forcing function for efficiency, with Michael citing the dramatic generation-over-generation efficiency gains in NVIDIA inference hardware as a reason the economics may normalize. The takeaway: do not assume the cloud-only AI era is permanent, and architectures that can run locally keep options open.

Pixeltable is a multimodal database where computed columns replace data plumbing

Marcel was clear that the name undersells it: Pixeltable is, at its core, an OLTP database in the spirit of Postgres, multi-user and fully transactional, but designed for multimodal AI applications. It adds column types for documents, videos, audio, images, and arrays, where media lives externally as file references (show up with a petabyte of video and it references rather than ingests). The signature idea is computed columns: declare that one column is the audio extracted from your video column (a UDF wrapping FFmpeg), and another is the transcript from running a transcription model on that audio, and Pixeltable builds the computational graph, plans execution, parallelizes it, and keeps everything transactional. Under the covers, structured data and the catalog live in Postgres with pgvector, with Pixeltable's own type system and transaction logic on top, plus integrations for Anthropic, OpenAI, and Pillow image transformations. Michael's reaction captures the appeal: it turns the database into a workflow engine that hides queues, async orchestration, and intermediate file management. As Marcel pointed out, for a podcaster who constantly extracts audio and transcripts from videos, this is very much on the nose. It is fully open source as a pip-installable package, with a cloud-hosted tables service (think RDS or Snowflake, but for multimodal AI) launching in the coming months.

From Parquet to Pixeltable: a database lineage thirty years deep

Marcel's backstory is its own takeaway for data folks. At Cloudera he started what became Apache Impala, a scale-out SQL engine, and co-created the Apache Parquet columnar file format because no good open source columnar format existed, essentially an open implementation of the ColumnIO format from Google's Dremel. Parquet went on to become an industry standard and now sits at the core of other standards like Apache Iceberg, which Marcel called gratifying to see. Before that he spent 2003 to 2010 at Google on scalable data infrastructure, including the F1 hybrid transactional and analytic processing system. The thread to Pixeltable started when he was an entrepreneur in residence at a venture firm in early 2022 and fell in with computer vision engineers, all working in Python, all drowning in data plumbing around training and dataset curation. Michael's side note is worth repeating for anyone still hoarding CSVs: if disk usage and slow parsing hurt, go look at Parquet.

The Go-core, Python-SDK pattern, and Python's comfort with polyglot stacks

Two of the five startups, Arcjet and CapiscIO, independently arrived at the same architecture: a performance-critical core written in Go with Python SDKs wrapping it. Beon confessed his team arrived at PyCon coached not to mention Go, only to find Python developers completely unbothered once they heard the reasoning, because the pattern is familiar. As Michael pointed out, Python itself runs on CPython, which is implemented in C, and much of modern Python performance comes from Rust layers underneath, via bindings like PyO3; his own websites run a Rust application server delegating to Flask. David chose Go partly because gRPC and Go come from Google and fit hand in glove. The lesson for builders: the Python community judges you on the experience you deliver in Python, including IDE feel and idiomatic APIs, not on the language of your core. Choose the right tool per layer and meet Python developers where they are.

Startup and career advice from the lane: problems first, and grow two T's

The wrap-up advice was some of the best content in the episode. Naunidh: do not be a solution in search of a problem; invest upfront in understanding customers, willingness to pay, and your unique point of view, and if you come to PyCon, be intentional about your objective and sign up for lightning talks early because they fill up fast. Grant, having watched a previous startup fail, emphasized solving concrete pain points people will pay for rather than nice-to-haves, and staying curious at the booth even when visitors have no idea what you do. Michael offered his long-standing career thesis: the secret sauce is intersecting a specialty, like investment banking, with programming skill, making you extremely powerful in a very small space. Naunidh updated the classic T-shaped developer model into two T's, one for technology depth and one for domain expertise. For listeners eyeing the 2027 batch, the practical path is clear: build with Python, stay under the age and size limits, and watch for applications around December.

us.pycon.org/2026/attend/startup-row

Interesting Quotes and Stories

Jason's accidental decade. Jason got involved in 2015 when the then-organizer needed someone to pinch hit as emcee for a pitch event in Chicago. He protested that he was not good on stage, conceded that he did know startups, and ten years later he is the program's primary organizer.

The flight recorder developer. Beon met a developer at PyCon who works on voice and data recorders for airplanes. Any AI in that world is introduced hesitantly and must be fully air-gapped and sandboxed, a reminder that the cloud-only mindset is not universal.

Don't mention Go. Beon's team arrived at PyCon coached to never admit their core was written in Go. When developers pressed and heard the reasoning, the response was a shrug: that makes sense, other products do the same.

"I always wondered, with my venture investing hat on, why aren't the VCs here? Because these are the people building incredible technologies." -- Shea Tate-Di Donna, on attending developer meetups

"It doesn't hurt that it's become the lingua franca for all of the large language models and the different evolution of the technology that we're seeing today." -- Shea Tate-Di Donna, on Python

"In finance, trust is the name of the currency. For our customers, it is essential that we are at 100% accuracy with every data point that we report because the decisions made are so consequential." -- Naunidh Singh Bhalla, Tetrix

"Don't be a solution in search of a problem. It should be the flipped way around." -- Naunidh Singh Bhalla, Tetrix

"There was this analogy of being a T-shaped developer... I think now it's almost like you have two T's. You have your technology T, but you also have your domain expertise T." -- Naunidh Singh Bhalla, Tetrix

"If you give developers the right tools and the right principles, they can just think of security as another feature that they have to think about and build." -- David Mytton, Arcjet

"You can write your own rate limiting library, you can do that pretty easily with Redis. But can you build your own bot detection and deal with the arms race of keeping up with automation?" -- David Mytton, Arcjet

"pip install something or npm install... it's just inviting other people's code that you don't know to run on your computer without checks, at least traditionally, which is insane." -- Michael Kennedy

"In the early 2000s, we invented this thing called the internet and we had all these great ideas, but there was a lot of infrastructure still missing to be able to realize that... With the whole agentic era, I think we've hit a reset button on a lot of that, except we're a lot more technologically mature." -- Beon de Nood, CapiscIO

"Everybody was chill. It was builder to builder conversations: what are you working on, how are you solving this problem? There were just a lot of real conversations. The vibe was great." -- Beon de Nood, on his first PyCon

"It gives you a lot of guardrails and it allows you to express these complex workflows with relatively little code. So there is less room for the AI to drive the thing off a cliff." -- Marcel Kornacker, Pixeltable

"I think the real secret sauce of being super successful is having some specialty like investment banking and some programming skill. And if you're quite good at both and you intersect them, you're all of a sudden extremely powerful in a very, very small space." -- Michael Kennedy

Key Definitions and Terms

Startup Row: A program at PyCon US, run in collaboration with the Python Software Foundation, that awards free expo hall booth space to early-stage startups that build with Python. See us.pycon.org/2026/attend/startup-row.
ICP (Ideal Customer Profile): A description of the customer who gets the most value from your product. Shea and Chinmaya both used the term to describe how Startup Row puts founders directly in front of theirs.
Private markets: Investments not traded on public exchanges, including venture capital, private equity, private credit, and private infrastructure. Tetrix serves the institutional investors (endowments, foundations, family offices, pension funds) who allocate to these asset classes.
Unicorn: A privately held startup valued at one billion dollars or more. Startup Row counts two among its alumni, including Chainguard.
Eval harness: Infrastructure for systematically testing AI model outputs against annotated, known-correct datasets, so teams can experiment quickly and catch regressions. Central to how Tetrix maintains accuracy.
WAF (Web Application Firewall): A traditional security appliance that filters traffic outside your application. David Mytton contrasts it with Arcjet's in-code SDK approach.
Prompt injection: An attack where malicious input manipulates an LLM into ignoring its instructions. Arcjet detects it by running specialized models in parallel and comparing results.
PII (Personally Identifiable Information): Data like email addresses and credit card numbers. Arcjet detects it locally, inside a WebAssembly sandbox, so it never leaves your environment.
WebAssembly (WASM): A portable binary format that runs sandboxed code at near-native speed inside another process. Arcjet ships its local analysis engine this way, using wasmtime.dev in Python.
gRPC: A high-performance RPC framework from Google with a binary wire protocol, used by Arcjet to keep request-path latency to a few milliseconds. See grpc.io.
PaaS (Platform as a Service): Cloud hosting where you push code and the platform handles servers, builds, and scaling. Phemeral is a PaaS focused on Python web apps.
Scale to zero / cold start: Serverless apps can shut down entirely when idle (costing nothing) but must start up when the next request arrives; the startup delay is the cold start. Phemeral attacks it with fast-starting VMs.
A2A (Agent2Agent) protocol: An open protocol, originally from Google, for AI agents to communicate with each other rather than just with tools or humans. Its release inspired CapiscIO. See a2a-protocol.org/latest.
OPA (Open Policy Agent): An open source policy engine; CapiscIO compiles agent permissions into OPA bundles cached locally for sub-10 millisecond decisions. See openpolicyagent.org.
MCP (Model Context Protocol): An open standard for connecting AI applications and agents to tools and data. Both Arcjet and CapiscIO integrate with it. See modelcontextprotocol.io.
llms.txt: A proposed convention for publishing LLM-friendly documentation at a well-known path so coding agents can learn your product. See llmstxt.org.
OLTP (Online Transaction Processing): Database workloads built around many small, concurrent, transactional reads and writes, like those behind a web app. Marcel describes Pixeltable as an OLTP system at its core.
Computed column: A Pixeltable column whose values are derived automatically from other columns, such as audio extracted from video, forming a transactional computational graph instead of hand-built pipelines.
Columnar format: A file layout that stores data by column rather than by row, enabling fast analytics and compression. Apache Parquet, co-created by Marcel Kornacker, is the canonical open source example.
Vendoring: Copying a dependency's source into your own codebase rather than installing it from a registry, reducing supply chain exposure. David and Michael expect AI coding agents to make this more common.
Air-gapped: Running systems with no connection to outside networks, required in highly sensitive environments like aviation. One reason CapiscIO ships via Docker.

Learning Resources

Want to go deeper on the ideas these founders are building with? Here are a few hand-picked courses from Talk Python Training that line up well with this episode.

Python for Entrepreneurs: The whole episode is founders building businesses on Python. This course walks you through creating and launching your own web-based business, from code to deployment to marketing, if Startup Row left you inspired to build.
Agentic AI Programming for Python: Skills, MCP servers, and coding agents came up at nearly every booth. Learn to work with agentic AI tools like Claude and Cursor as a force multiplier on real Python projects, the same workflow these startups are designing for.
Python Web Security: OWASP Top 10 with Agentic AI: Arcjet's segment makes the case that security belongs in developers' hands. This course puts it there, working through the OWASP Top 10 in Flask, Django, and FastAPI and building an AI security agent that audits your own apps.

Overall Takeaway

Walk the lane end to end and the message is unmistakable: the Python startup ecosystem in 2026 is an AI infrastructure ecosystem. Five very different companies, investing platforms, security SDKs, hosting clouds, agent identity layers, and multimodal databases, are all answering the same question from different angles: how do we make AI trustworthy, deployable, and productive in real production systems? And they are all answering it in Python, the language Shea called the lingua franca of the LLM era. Just as striking is how achievable this all looks up close. These are tiny teams, some still in early access, some at their very first PyCon, getting free booth space through a program that started with a conversation between Paul Graham and the PSF, and whose alumni now include two unicorns. The bar to entry is using Python, being small, and being early, and the organizers are literally pinging promising builders to ask them to apply. If you have domain expertise and Python skills, the two T's Naunidh described, the distance between you and a booth on next year's Startup Row is shorter than you think. Applications open around December. Maybe it is your company we stroll past next.

Episode Transcript

Collapse transcript

WebVTT format On GitHub

00:00 If you've ever been to PyCon, you know one of the best parts of the expo hall is Startup Row,

00:04 a stretch of booths where early-stage companies built on Python show off what they're creating,

00:08 but only attendees get to walk that lane. So let's bring it to everyone. In this episode,

00:13 we stroll down Startup Lane together. We kick things off with the organizers, Jason and Shay,

00:18 who share the program's origin story going back to Paul Graham and the PSF, plus some surprising

00:24 stats, including two unicorns among the alumni. Then we meet five startups, Tetrix, bringing AI

00:31 to institutional investing in private markets, Arcjet, security that lives inside your app as an

00:36 SDK, Femoral.dev, serverless hosting built for Python web apps, Capicio, an identity and authority

00:42 layer for AI agents, and Pixeltable, a multimodal database from Marcel Kornacker, co-creator of Apache

00:49 Parquet. See if you can spot the theme running through them all. Let's go for a walk. This is

00:54 Episode 551, recorded June 11th, 2026.

01:15 Welcome to Talk Python To Me, the number one Python podcast for developers and data scientists.

01:20 This is your host, Michael Kennedy.

01:22 I'm a PSF fellow who's been coding for over 25 years.

01:26 Let's connect on social media.

01:28 You'll find me and Talk Python on Mastodon, Bluesky, and X.

01:31 The social links are all in your show notes.

01:34 You can find over 10 years of past episodes at talkpython.fm.

01:37 And if you want to be part of the show, you can join our recording live streams.

01:41 That's right.

01:42 We live stream the raw uncut version of each episode on YouTube.

01:45 Just visit talkpython.fm/youtube to see the schedule of upcoming events.

01:50 Be sure to subscribe there and press the bell so you'll get notified anytime we're recording.

01:54 What if your AI agents worked like FastAPI microservices, typed, autonomous, and discovering each other at runtime?

02:01 That's the world AgentField is building.

02:03 Join them at talkpython.fm/agentfield.

02:07 To kick off this episode, we have the organizers of Startup Row at PyCon, Jason and Shay.

02:14 Welcome to Talk Python To Me.

02:15 Great to meet you.

02:16 Thanks for having us.

02:17 Great to be here.

02:18 I think this is a really cool idea. Long ago, way back, I think it was 2023, I did a stroll down startup lane. And the idea was, hey, this startup row is a really cool experience. But only attendees of the conference actually get to walk down the lane, walk down startup row, talk to each company, see what they're building, get that energy. So let's bring it to the podcast listeners, bring it to the world. So here we are. And you two are the organizers, right?

02:47 Mm-hmm. That's right.

02:48 Well, thank you very much on behalf of everyone.

02:50 Quick introduction.

02:52 Shay, welcome to the show.

02:53 Thank you.

02:54 Yeah, tell us about yourself.

02:55 So my background's in early stage venture and technology.

02:57 I was part of the founding team of a fund called True Ventures.

03:00 And then I was there for three funds, seven years.

03:03 And then after that, decided I wanted to start a company of my own.

03:07 So we were a full stack Python dev shop, and we were building an EdTech SaaS product.

03:12 And we had the opportunity to be featured on Startup Row at PyCon in Montreal in 2015.

03:18 So that was my introduction to the community.

03:20 And I had a fantastic experience.

03:22 And then we later got acquired.

03:24 And so in order to give back to the community, I started judging with Jason a couple years

03:29 later.

03:29 He asked me if I would help contribute to the process by being one of the judges that

03:33 selected the companies.

03:34 And then a couple of years ago, asked me if I would co-organize with him.

03:37 So that's how I got involved.

03:39 I've loved being part of the community.

03:41 I think it's fantastic and used to go to software development meetups in San Francisco to recruit technical talent.

03:47 And I always wondered, you know, with my venture investing hat on, why aren't the VCs here?

03:51 Because these are the people building incredible technologies.

03:53 And so really see the opportunity to feature these great companies that have become part of our corpus of startup for talent.

04:00 So you've lived the entire journey, which is awesome. Congratulations.

04:04 Thank you.

04:05 Jason, welcome.

04:06 Hey, good to be here.

04:08 So how did I get involved?

04:09 It was around 2015 when I was a financial journalist primarily covering startups and technology and venture capital stuff.

04:21 When at the time, the then organizer of Startup Row asked me to pinch hit for a friend of mine who was going to be co-hosting or emceeing a pitch event in Chicago.

04:38 And unfortunately, my friend wasn't able to participate.

04:43 And at the time, the organizer, Startup Row, asked me, oh, hey, Jason, you do some stuff with media.

04:49 You seem like you'd be good on stage.

04:52 I'm like, no, that's not true.

04:54 And he's just like, but you know some stuff about startups.

04:56 And I'm like, yeah, that is true.

04:57 And long story short is I got convinced to host this pitch event, and I really fell in love with the Python community.

05:07 This was during a time in my career where I was sort of transitioning into doing more like data journalism.

05:13 And I got really frustrated with like the limits of what I could do with, you know, Excel, you know.

05:20 And this was just at the very beginning of my journey, learning what little Python that I do know.

05:27 And so, you know, it's an amazing community.

05:30 and very long story short, this is my 10th year of organizing or being a co-organizer of Startup

05:37 Row. I sort of became the primary organizer in 2019, and I'm really excited to have Shea board.

05:45 I have some statistics to share, but I'm happy to share those later.

05:50 But yeah, that's a little bit about my background. Awesome. Very interesting.

05:53 So let's, I do want to hear your statistics. I think they're very interesting. And I have a

05:58 little hint of what we discussed. So I have a hint of where you're going. But before we get into

06:03 that, let's just give everyone a quick, Shay, maybe give everyone a quick overview. What is Startup Row?

06:09 Why are companies there? How do they get there? Yeah, it's a great backstory, actually. So in 2011,

06:14 Paul Graham of Y Combinator was collaborating with the Python Software Foundation. And with the

06:19 impetus of we want to see more startups at PyCon, we want to feature more of these amazing companies

06:24 that are starting new technologies and building things from scratch.

06:27 We want to give them an opportunity for a spotlight, but many of them are so early stage they can't afford the price of admission

06:32 or the conference ticket.

06:33 What can we do to help create an on-ramp for them and then to feature them?

06:37 And that's the origin of how it started.

06:39 And it's been amazing that we're 15 years in and the company is just fantastic.

06:46 As Jason alluded, we've got some great alumni companies as well.

06:51 All right. Well, let's hear the stats, Jason.

06:54 Okay, so since the program's inception, there's been approximately 170, 175 companies that have come through Startup Row.

07:03 The format of the program has changed a little bit over time.

07:07 It used to be the case that there would be eight companies featured, say, on Friday, and then a whole brand new batch of eight companies featured on Saturday, the two different days of the primary expo hall.

07:19 That was kind of a crazy and untenable situation for everybody in the long haul.

07:25 And so we have now eight companies per batch.

07:28 And so if you look at from 2019 through 2025, so that's counted six batches.

07:36 You know, we're looking at about eight companies.

07:39 Oh, hang on one second.

07:40 Was that six batches?

07:42 One moment.

07:42 I apologize.

07:43 Seven batches.

07:44 Forgive me.

07:45 Seven batches.

07:46 We have roughly 60 companies in that pool.

07:51 Of those, we have 32 of them are currently active.

07:56 11 of them have been acquired.

07:59 15 of them are unfortunately no longer with us.

08:02 And a couple of them, based on public activity, a little bit difficult to tell where they're at.

08:08 But the bottom line is that of this population of companies that have come through recent startup row batches,

08:15 You know, we're looking at approximately, you know, 12 to, you know, 15 percent of them having been acquired.

08:23 And then also in that group, we have a couple of companies that have achieved two companies that have achieved unicorn status.

08:30 So evaluation of a billion dollars or more.

08:33 That includes a returning PyCon sponsor, ChainGuard, which is, I think, valued at three and a half or three point eight billion dollars today after their most recent funding round.

08:47 And, yeah, it's it's a it's a pretty impressive population of companies that have come through Startup Row.

08:55 And it's been a real joy to see, you know, where everybody has gone after their, you know, after their time at PyCon.

09:03 That's incredible. I think those are super good numbers compared to just the general population of startups. Maybe we could kind of close it out with your thoughts, Shay, since you have some of the investing side. Those sound like good numbers to you as well?

09:15 That's fantastic. And one of the things I enjoy is that it's such a robust community that when a company comes into Startup Row, they're getting to meet with their ICP, their ideal customer profile. They're getting to hear straight from the mouths of the developers.

09:30 They're also fantastic people.

09:32 So it's really exciting to help connect them with the right opportunities and to see some of the larger sponsors at PyCon come over and interface with them and learn about what they're developing.

09:41 And one of my favorite times at Startup Row is to see everybody gathered around the computer screen watching the demo and pointing and troubleshooting.

09:49 And one of the companies that you've interviewed, they were saying how many uploads they've had of their product even just during the two days that they were featured.

09:56 And so you see the excitement straight from the developers.

10:00 It doesn't hurt that it's become the lingua franca for all of the large language models and the different evolution of the technology that we're seeing today.

10:08 So Jason, I think it's a fantastic opportunity.

10:10 And we love being part of helping the founders on their journey.

10:14 Well, let's see if we can get some extra work for you all next year.

10:17 Maybe you all could give a quick shout out just to how people apply and what the criteria are.

10:21 Oh, absolutely.

10:24 Everyone's take it.

10:25 Oh, sorry.

10:26 Oh, sure.

10:28 Sure.

10:29 Absolutely.

10:29 So there are three very basic rules for qualifying to be on Startup Row.

10:36 And two of them are somewhat fuzzy.

10:39 The one non-negotiable is obviously you got to use Python somewhere in your stack.

10:43 The more the better.

10:44 There is definitely a slight implicit bias in favor of companies that are building open source projects.

10:52 But that is by no means a requirement to be on Startup Row.

10:55 In addition to that, in general, companies are younger than sort of two and a half or

11:00 three years old.

11:01 And that can be measured from the, you know, either from the incorporation date or like

11:07 if the team has been working on a project and then did some major pivot like a couple

11:11 years ago, you know, that's not necessarily a count against their, you know, their eligibility.

11:17 It's just like, you know, we're trying not to feature companies that were founded 10

11:21 years ago and are, you know, more justifiably categorized as like robust small software

11:26 businesses.

11:27 Right.

11:27 And then the final one is, you know, in general, roughly 25 people on the team or smaller,

11:35 again, with a bias in favor of smaller teams, just in service of there are plenty.

11:40 I'm sure that there are plenty, you know, 120 person Series B startups that would love

11:45 free booth space on Startup Row at PyCon US.

11:48 But this really is for much smaller, earlier stage, high growth ventures.

11:52 That sounds great.

11:53 Final word, Shay, how do people apply?

11:56 When do applications open?

11:58 Things like that.

11:58 Yes, it moves a little bit depending upon our collaboration with the Python Software Foundation.

12:03 But usually applications open right around December, beginning of the year, because PyCon,

12:09 as you know, is held in May.

12:10 And so we like to have plenty of lead time for the companies to apply.

12:13 And also we're trying to coordinate better with the speaker talks.

12:16 They can do a submission to have a speaker talk as well, but usually around December, early December.

12:21 And Jason, we should probably put an email address up so people can start to reach out in advance if they'd like as well.

12:28 Yes, we should. I do believe we have startups at python.org or it's startupro at python.org.

12:36 But Michael, I'll get back to you and confirm.

12:39 Let's do this. Send it to me and I'll put it in the show notes so people have it right there.

12:44 That would be great.

12:45 Once we put the call out, it's a quick intake application, a few short questions, and then

12:50 we have a panel of judges with different backgrounds to evaluate.

12:53 And the news usually goes out in January and then tee everything up from there.

12:57 Fantastic.

12:58 All right.

12:59 Shay, Jason, thank you for being on the show.

13:01 And thanks for organizing Startup Row.

13:02 It's been great.

13:03 Thank you, Michael.

13:04 Great to meet you.

13:05 Yeah.

13:05 Yeah, you bet.

13:05 Bye-bye.

13:06 Cheers.

13:08 This portion of Talk Python To Me is brought to you by AgentField.

13:11 What happens when you give hundreds of AI agents a shared code base and let them write code,

13:17 review each other's work, and ship to production?

13:20 Well, that's exactly what the team behind AgentField AI built.

13:24 And the wild part, it's not some proprietary system locked behind a paywall.

13:28 It's an open source Python library.

13:30 Now, where most agent frameworks have you wiring up DAGs and workflows, AgentField lets you build AI agents the way you'd build FastAPI microservices.

13:40 Think typed Python functions that become autonomous services.

13:44 They discover each other at runtime, call each other like APIs, scale independently,

13:50 fail independently, and recover on their own.

13:52 And here's the thing.

13:53 You're not just orchestrating LLM calls.

13:56 You can orchestrate entire anonymous tools.

13:58 Spin up multiple Claude Code instances, Codex sessions, any coding harness you want,

14:04 all running as live nodes on the same architecture, collaborating and verifying each other's output.

14:10 that's how they build the factory and it's completely free and open source check it out

14:14 at talkpython.fm/agentfield that's talkpython.fm/agentfield the link is in your podcast

14:21 player show notes thank you to agentfield for supporting the show now let's meet our first

14:27 startup tetrix from nanid and grant nanid grant welcome to talk python and me amazing to have you

14:34 here thank you for having us michael yeah congratulations on being part of startup row

14:39 A few years ago, I did this episode where we did a stroll down startup lane, and I'm very excited to do it again this year.

14:45 And there's a theme. We'll see if people can detect the theme throughout this episode.

14:50 But it is also evident here, and that's great.

14:53 So let's just do a quick round of introductions before we dive into Tetrix, your company, and how you got on Startup Row and stuff.

14:59 Nanit, go first.

15:00 Sure, happy to. So I'm Nanit. I'm the co-founder and chief technology officer of Tetrix.

15:05 I've been a huge nerd about tech ever since I was a child and transformed that into my degree, then eventually a career.

15:12 And now I get to work with some of the smartest people building some of the coolest things in financial markets.

15:16 Very cool. It's such an incredible time to be a builder. So amazing. Grant. Hello.

15:21 Yeah. Hi. So I'm Grant. I'm the founding engineer at Tetrix.

15:26 Background in finance and technology. Former investment banker turned software engineer.

15:32 So really excited to, was really excited to join Tetrix and build what we have.

15:39 Well, I have a really good friend who has worked in basically as a lawyer and in banking and so on.

15:44 And he also does tech now.

15:45 We love to exchange stories.

15:47 It's a different kind of world, huh?

15:48 Very.

15:50 I think it's a fun one though.

15:52 And, you know, let's talk about your project that you've been working on, Tetrix.

15:55 So this is what you were presenting on Startup Row.

15:58 I'll tell it to the audience.

15:59 What do you got?

16:00 Absolutely.

16:01 So Tetrix is an AI investment platform for institutional investors investing in private

16:05 markets.

16:06 And I know that's a mouthful for people who aren't in this space.

16:09 So let me break it down a little bit.

16:11 So AI investment platform for institutional investors.

16:14 So our customers tend to be institutional investors like endowments, foundations, family

16:19 offices, pension funds.

16:21 Think of it as the pool of investors with the largest amounts of capital in the world.

16:24 And when you have that much capital, you're not just invested in public markets, which

16:28 is stuff like stocks, bonds, things that people are generally familiar with. But you also invest

16:33 in private markets, which is venture capital, private equity, private credit, private infrastructure,

16:37 and those asset classes. And what Tetrix does is it makes the management of your investments in

16:43 private markets much easier by solving three main pain points. Pain point one is the pain of

16:49 monitoring and collecting documents from multiple different sources on the internet. Pain point

16:53 number two is taking all of that unstructured data, structuring it and normalizing it,

16:58 And then solving pain point three, which is the lack of analytics and insights that currently

17:02 exist in the market by bridging that with our data sets and our extractions to make sure

17:07 that institutional investors can make more confident investment decisions.

17:11 And the platform has really taken off since we launched it.

17:14 And the ROI is very evident, whether it's compression of time from 45 days to one day

17:18 to insights, whether it is the thousands of hours manually being saved or the 10x deeper

17:23 insights that customers are generating from the platform.

17:25 Okay.

17:26 Well, that sounds super interesting.

17:27 I did need to, when I first spoke to you, get a little bit of education on what limited partners exactly means in investing.

17:33 If I think about where AI has leverage, people talk about, oh, the AI bubble.

17:38 Maybe like my male client doesn't necessarily need AI and that kind of just gets in the way.

17:43 But there are areas where it applies so well.

17:46 I think coding and data science is probably number one.

17:49 But I've always felt that investing has this special lever for AI that you could really get a lot out of.

17:56 And I haven't done nothing with it.

17:57 I don't know why.

17:58 I feel like, I guess because I don't have the experience, but Grant, you've been in a banker.

18:02 So you know the actual day-to-day problems, right?

18:05 Yeah, exactly.

18:06 I mean, one of the, like, I mean, as a banker, like doing what we do, like docking processing,

18:11 like that was all manual effort of like literally typing in like values into Excel from like PDF reports, those types of things.

18:19 And being able to kind of like streamline that really manual process is like really fascinating.

18:25 Because there were so many hours spent late at night doing this very annoying manual process.

18:30 And now that you can kind of do that unstructured to structured transition with LLMs is really exciting.

18:36 It's super exciting.

18:38 And people always say, oh, man, I always wanted to have some kind of thing help me out.

18:45 But I wanted to help me with the drudgery.

18:46 I didn't want it to write music and write code.

18:48 I wanted to do the boring stuff and I wanted to do the exciting stuff.

18:51 But it sounds like you're kind of leveraging it for the boring stuff here.

18:54 Exactly.

18:55 Exactly. And even not just for customer specific use cases, when you look and peer behind the

19:00 curtain at Tetrix internally, whether we're on the engineering side, the business side,

19:04 everyone is leveraging what's being shipped out, whether it's connecting to MCP servers to make

19:08 their workflows more efficient, whether it is using cloud routines to automate some of the tasks.

19:13 We are always thinking about how can we execute faster internally and how can we keep delivering

19:17 value to our customers using the AI as a tool rather than just kind of AI wash everything and

19:22 say we are an AI native company just for the sake of it.

19:25 I mean, that's where the VC is, but just kidding.

19:28 Sort of, sort of.

19:29 But how do you guys go about making sure that the AI stays on track?

19:34 You know, one, I actually, I did use AI for this investment thing that I'm doing.

19:40 And it's like a retirement planner, which is fine, you know, but it's not invested in like

19:44 the same sense that you all are.

19:45 And the AI like gives you recommendations on how your retirement plan is going.

19:49 And I get the sense, I don't know for sure.

19:51 I haven't used enough.

19:52 I get the sense they're using a cheap model to get fast results, which makes me sad.

19:57 Like if they were using Opus or GPT-5 Pro, you could get such good answers.

20:03 But it's really, really quick and responding just makes me real nervous that the answers are super shallow.

20:07 So how do you, I mean, it's important the answers and the details you give to people.

20:11 They're making really large decisions.

20:13 It's not like, oh, I'll buy $1,000 of this stock.

20:16 You know, I'll buy thousands of shares of this stock type of thing, given your clientele.

20:21 So how do you make sure the AI is on rails?

20:24 No, that's an excellent question.

20:26 And to your point in finance, like trust is the name of the currency.

20:29 And for our customers, it is essential that we are at 100% accuracy with every data point

20:35 that we report because the decisions made are so consequential.

20:38 To answer your question, there's quite a bit of discipline that we have within the Tetrix

20:42 engineering processes to make sure that we are able to get more deterministic nature

20:46 of outputs from something that's inherently probabilistic.

20:49 And part of those, without disclosing too much of the secret sauce, but some top things are A,

20:54 we invested a lot upfront in building our AI and evaluation harnesses. The good news is that the

21:00 fact that we need 100% data accuracy means that we have very, very good data sets that are already

21:06 annotated, have a human in the loop in terms of making sure it's 100%. And when we run experiments

21:11 against those harnesses and those evals, it allows us to move really quickly and also figure out

21:16 very quickly if something is breaking or not. So that was number one, because you can't solve what

21:20 you don't know. Number two was most of our team has a lot of financial background, whether it is

21:25 Grant, obviously, as an ex-investment banker, my co-founder or others. And what we have done is take

21:30 that institutional knowledge of finance and encode it into a set of over 250 financial rules that are

21:36 checking the outputs generated by the LLMs to make sure that things actually make sense from a

21:42 financial perspective, whether it's within a document or when you look at things from a time

21:45 series perspective. And number three is we have also built a lot of feedback loops so that whenever

21:50 there's a human review or there's an erroneous data point spotted, we actually learn from that

21:54 behavior and make sure that the subsequent extractions keep getting better and better.

21:58 So with some of these things, we have been able to really get very high accuracy. And in private

22:03 markets, I would argue we have the highest accuracy out of the box where our average accuracy is 96%.

22:09 And that number keeps ticking upwards as we continue investing in this part of the platform.

22:13 cool. So why Startup Row? How'd you end up there? Yeah, so we applied for Startup Row to, so we

22:20 wanted to be more integrated into the Python community and also to expand our team. So we

22:28 applied given that PyCon's the largest event conference for Python developers and our whole

22:35 backend is built in Python. We wanted to kind of go there for a recruiting perspective, get the best

22:39 and brightest talent and also see what other people, other companies and, you know, individuals

22:44 are working on, whether it be open source projects or, you know, companies that we use

22:49 on a day to day basis.

22:51 Yeah.

22:51 So basically, and also to tell people about what we're doing and kind of get the word

22:57 out about Tetrix.

22:59 Yeah, cool.

23:00 So you're looking for possibly hiring some Python folks?

23:03 Yep.

23:04 Across the stack, we're looking to definitely scale up our engineering team.

23:09 so yeah, I mean, front end, full stack back end, but you know, from a, an AML,

23:14 but from a coding perspective, like definitely at least a lot of their backend data pipeline,

23:19 that kind of thing, fully in Python.

23:21 Well, I was talking to someone else.

23:22 I won't call them out explicitly just now, but they had a really interesting insight

23:26 saying that, you know, AI is going to change our jobs, obviously.

23:30 I mean, that's not a huge revelation, but, but that it's probably going to shrink really

23:35 large companies, because there's kind of a lot of support that maybe the AIs can do, but it's also

23:40 going to expand out the existence of small companies, what small companies can do and so on.

23:46 And it kind of sounds like you're a bit of an embodiment of that. Yeah, I think that's a fair

23:51 analysis. I mean, we are a small but mighty team. We're growing, adding headcount as needed,

23:56 but AI is obviously supercharging productivity internally as well. So what a person can deliver

24:02 in a unit time has for sure increased and we want to keep leveraging those things

24:06 to make sure that's true.

24:07 But at the same time, there's so much demand for the product at the moment

24:11 that capacity still is the constraint and hence why we're very excited to bring on more folks

24:16 who have Python expertise into our community and into our team and work with them.

24:20 That sounds really good.

24:21 That sounds really positive.

24:23 Like things are going well for you.

24:24 I talked to Jason, the organizer of Startup Row and pointed out how many of the companies

24:28 are either still going, got acquired or went public or whatever.

24:32 So there's a good track record there.

24:34 So how did your experience at Startup Row go?

24:37 Was it worth going?

24:38 Did you meet people who might be investors or partners?

24:42 Did you find people who might build some of these roles?

24:44 Was it worth going?

24:45 Yeah, definitely.

24:46 I mean, it was definitely a really great experience.

24:48 I mean, we got to basically talk with a lot of people from different backgrounds.

24:53 Yeah, I found, I mean, definitely a lot of promising candidates.

24:56 And also it's very just cool to talk about our company to people who are like kind of like very large established financial services companies and tell them about what we're doing.

25:09 They kind of get the business problem.

25:11 But it's also really cool to see that we're in kind of the same like we're in Startup Row, but like we're talking to the same people and kind of being involved in the Python community as some of these really large established players.

25:24 Yeah, when you all describe what you're doing, I'm like, there's something with a long sales cycle. People don't go to your site and just put in the credit card.

25:31 Very true.

25:33 The laugh of, you have no idea how hard this is to sell to enterprise, right?

25:39 Just the timeframe from first contact to like, yeah, we'll do it is I'm sure really, really large. I've been on that site a little bit and it's kind of mind blowing.

25:48 All right, a couple of things to wrap things up here.

25:50 Give us a look inside, like a tech architectural look of what you're all building.

25:55 You know, it's got some Python stuff going on, but what kind of tech is at play

26:00 without giving away secret sauce?

26:01 Yeah, so generally like our whole backend is Python.

26:05 So we use FastAPI for all our, you know, HTTP services, use Pydantic for models and data validation,

26:14 use a lot of like, you know, when we use AI, we're usually calling or doing things from a Python client for all of our data

26:23 pipelines. Yeah. And I mean, we obviously use like other technologies, but, you know, I mean,

26:29 be it like, you know, pandas or, you know, NumPy for like kind of more of the analytic side and then

26:34 different kinds of packages for doing OCR and data extraction and all that stuff is basically Python.

26:42 Fun. It's a cool tech stack. It's really neat to work with these things these days. It's a lot of nice tools. So what's on the roadmap? You are host startup row. You've had a lot of conversations. Where are you leading? Maybe did also did some of these conversations influence you a bit on what you should focus on?

26:58 That's a good question. So, well, we have some exciting fundraising announcements coming soon. And as part of that, we obviously are going to spend a lot of time doubling down on the product and delivering the best customer experience possible. So in terms of some things that we're excited about, and I'll put the business hat on and then the tech nerd hat on. But on the business hat side, we're working on some very interesting new features. So if you look at Tetrix, you can kind of think of it as a three layered cake. There's the data capability layer, there is the workflows layer, and then there's the inside.

27:28 layer that we deliver to customers. The data layer, we've been spending a lot of time since

27:33 the start of the company. And I think we're in a really, really good position there.

27:37 And we really want to start double clicking on the workflow and the insights there. So a lot of our

27:41 time is being spent for use cases specific to our customer on those two layers, whether it is

27:46 being able to diligence, net new opportunities faster, whether it's being able to get flexible

27:51 outputs in terms of investor memos and some of those things. So we'll be doubling down on those

27:55 things. On the tech side, where we're spending a lot of time on the roadmap is scaling and

28:00 optimization. As we grow as a company, the amount of data points we are seeing is way larger than

28:05 we have in the past. And that obviously puts pressure on the systems itself, but is a very

28:10 fun and interesting technical problem of how do you end up scaling that data. AI is moving super

28:14 quickly. So all of us have the pulse on what's going on. And we're always thinking of ways to

28:18 kind of leverage that if it makes sense within our product. So those are some of the many,

28:23 many, many highlights that we have on the roadmap right now, but hopefully gives a good teaser of

28:27 how we think through the product and how we think throughout on the tech side.

28:30 Cool. That sounds very great. Very nice plans there. So final thought, as we wrap up our time

28:37 together, what advice would you have for people who maybe are thinking to start a company or have

28:42 just started or maybe pre-revenue or something along those lines who want to be on startup row?

28:48 I guess we each can take this one. I'm happy to start us off. I think general advice for a startup,

28:53 don't be a solution in search of a problem.

28:55 It should be the flipped way around.

28:56 So put in all the effort upfront in terms of really understanding your customers,

29:00 the willingness to pay, what value are you providing relative to what's out there?

29:04 What's your unique point of view?

29:06 Those things.

29:06 So that's the general word of advice there.

29:08 And I think for startup role specifically, the advice is be very intentional

29:12 of why you're showing up to PyCon.

29:14 Everyone's time is precious, including yours.

29:16 So make sure there is a true objective there.

29:18 And one thing I didn't realize actually going in and wish I had known earlier,

29:22 Lightning talks fill up really fast.

29:24 So more tactically, make sure you sign up for lightning talks ahead of time.

29:29 So that's my piece of advice generally and with regards to PyCon.

29:32 Excellent, Grant.

29:33 Yeah, so for me, I was at a startup prior to Tetrix, which did not succeed.

29:39 And being part of Tetrix, which is doing really well.

29:42 I think the key thing is having really concrete problems that you can solve for your customers

29:48 and being able to, you know, be able to have a product that really alleviates

29:54 a pain point that people are willing to pay money for versus like, this isn't like nice data to have,

29:58 nice thing to do, but versus something that's like, wow, this really like revolutionizes the way I do my job

30:05 or anything like that.

30:06 And the fact that we've been able to, you know, you know, really scale up and gain like

30:11 these really high profile customers and we're kind of like transforming their whole experience

30:16 just means that we're solving a really important problem.

30:19 And that's kind of like, you know, what I've seen at, you know, at Tetrix versus my prior company.

30:24 And I'd say for startups coming to Startup Row, I mean, I think just be really like curious

30:31 and open to different people and like, you know, see what, really be interested in what people are working on.

30:36 You know, maybe someone comes, talks to you, they have no idea what you're doing,

30:40 but you can at least like go and explain like how your company's working,

30:43 how you're using Python, kind of connect from an engineering perspective.

30:46 because a lot of those like engineering problems are kind of the same.

30:49 I mean, it's all, you know, data and users and, you know, dealing with those types of problems.

30:54 So just trying to educate, but also take in as much as you can.

30:59 Cool.

30:59 Well, let me just riff, just a moment before we close it out here.

31:03 I've always said that it's, I know a lot of people are getting into tech

31:07 or getting into programming.

31:08 They're like, I want to be a software developer and I'm choosing APIs or I'm choosing, you know, data science or something.

31:15 And that's great.

31:16 But I think the real secret sauce of being super successful is having some specialty like investment banking and some programming skill.

31:25 And if you're quite good at both and you intersect them, you're all of a sudden extremely powerful in a very, very small space, right?

31:32 Like taking your knowledge of banking and turning it into this product with programming and AI skills puts it into a really rare space, which I think that's a really good way to lever up your career.

31:44 as people are thinking like, oh, it's hard to get a job at a big tech company.

31:46 It's like, well, maybe you don't check just chase just straight down the line of I'm just

31:50 going to be a web developer, but some kind of intersection with your specialty or your

31:54 passion.

31:54 Yeah, exactly.

31:55 And one of the great things about startup is like you're really close to the business

31:58 problem and you see kind of like software and it's more targeted how you're going to

32:03 solve like actual problems that your users are going to are facing and you're getting

32:07 the feedback live and it's really rewarding environment.

32:11 And also you get to learn a ton because you're basically working across the stack on every

32:17 problem that you're getting.

32:18 And it's really an awesome way to learn programming.

32:21 Yeah, that's awesome.

32:22 I love small companies.

32:23 Now, Ned, you got the final comment for the show.

32:26 Let's wrap up the segment.

32:27 It's just the thing that we were saying reminded me of back in the day, there was this analogy

32:31 of being a T-shaped developer where you cut across the stack and are deep in one thing

32:35 technology-wise.

32:36 I think now it's almost like you have two T's to what you say.

32:38 You have like your technology T, but you also have your domain expertise T.

32:42 And that's kind of the next metamorphosis, I guess, of how the engineering role is going to play out.

32:48 But my final word is just thank you again to the Python community.

32:51 Thank you to you, Michael, for having us.

32:53 And we're really excited as a company at Tetrix here to be deepening our partnerships.

32:58 And if anyone is looking for roles or wants to join an exciting startup, Tetrix is always on the lookout for the best and the brightest.

33:04 And we'd love to continue the conversation as we continue to scale.

33:06 Awesome. Well, congratulations.

33:08 you guys for your success so far and wishing you more in the future talk to you later thank you so

33:13 much michael talk to you thanks michael next up we have david from arcjet david welcome to talk

33:22 python to me awesome to have you here for this startup row segment yeah thanks for having me

33:27 it's gonna be fun it was nice to meet you at long beach i enjoyed the conference a lot how was it for

33:31 you yeah it was good and the python community is great so always good to be there in person it's

33:36 not my first one. So it's always good. It's a little rejuvenating, at least for me, to

33:42 actually connect with the humans on the other side of the docs and projects and so on.

33:46 Yeah, absolutely. Yeah, well, let's just start with who you are. Do a quick introduction for

33:51 the listeners. Yeah, I'm David Mytton. I'm the founder and CEO of Arcjet, which is a runtime

33:57 application security startup. I also run a DevTools newsletter called Console.dev, where I review

34:05 interesting developer tools.

34:07 35,000 subscribers have been doing that for almost six years now.

34:10 But the day job is helping developers secure their applications.

34:14 Very cool.

34:15 So I'll be sure to put your newsletter in the show notes.

34:18 People can check it out.

34:19 But security, you sure it's not just a fad?

34:23 Yeah, everything is secure.

34:24 No problems.

34:25 We've solved security.

34:26 You can all go home.

34:27 Yeah, exactly.

34:28 We have so many tools and we have so much more awareness, but we also have a lot more

34:32 clever people trying to do bad things, right?

34:35 Like just look at all the supply chain issues we're running into.

34:38 Right. Supply chain is a real problem.

34:41 And it's been ignored for a very long time.

34:43 There's some really interesting tools that are coming up around that.

34:47 Arcjet sits in your application once you deploy into production.

34:50 So we protect against attacks coming in, automation.

34:54 We help you enforce budget controls, deal with prompt injection.

34:58 But the supply chain is just as important.

35:02 I kind of split it pre-prod and prod.

35:04 And we're probably on to year end production.

35:06 And then pre-prod is all the code scanning, dependency management, all those kinds of things.

35:10 Yeah.

35:11 Some of the worst attacks have actually been through this whole supply chain side.

35:15 You probably remember long ago when the internet first came into existence,

35:21 some of these systems didn't have passwords.

35:23 It's just, what's your username?

35:24 Oh, okay.

35:25 Hi, welcome back, Dr. Falcon or whatever.

35:28 And it seems so naive.

35:29 Like, how could that be?

35:30 But I think, you know, pip install something or npm install, it's almost, it's just inviting

35:36 other people's code that you don't know to run on your computer without checks, at least

35:40 traditionally, which is insane.

35:41 So I don't want to go down that rat hole, but it's just, you know, the thought of that

35:45 analogy there is like a little stark, I think.

35:48 But let's talk about Arcjet.

35:49 So it is security that runs inside your code.

35:53 Tell us, I know you gave a little bit of a hint there, but tell us a little bit more about

35:56 it.

35:57 developers have this really bad reputation for not caring about security and that was front of mind

36:04 when i started the company because like you said there are so many security tools it's just they're

36:08 not built for developers they're outside of your code they're outside of your editor they don't have

36:14 any connection to what's going on inside your code base and so developers would often just delegate

36:20 that to a third-party service or a different team and just like with devops where developers take

36:27 responsibility for the applications that they're building, I think we can apply similar principles

36:33 to security. Whereas if you give developers the right tools and the right principles,

36:37 they can just think of security as another feature that they have to think about and build.

36:41 And so that's what Arcjet is. It's an SDK. You drop it into your application dependencies.

36:46 We support Python, JavaScript, TypeScript, and Go. And then it gives you these building blocks to

36:53 solve problems and the problem framing is critical because developers when they have to think about

36:59 security often do it once as a problem they're not buying a WAF they're not trying to buy code

37:05 scanning static analysis tools they don't search for those things they search for i'm getting spam

37:11 signups or someone's abusing my application and i need to apply budget controls these are specific

37:18 developer-focused problems.

37:20 And that's where we try and help developers.

37:23 We give them building blocks for bot detection, for sign-up spam protection, for prompt injection.

37:29 And more recently, we're creating these in a way that means you can just get your coding agent

37:34 to solve it for you.

37:35 That's what you've got on the screen there.

37:37 And you just install our skill, and then you ask your coding agent, set up Arcjet, and it will go through,

37:43 examine your code, figure out what endpoints, what routes, what tool calls need protection, and then install Arcjet and help you solve the

37:51 security problem. That is really neat. I love how it knows, basically bootstraps itself onto the

37:57 system by teaching the AI agents how to do things. I'm really starting to notice people deeply

38:03 thinking about how to support AI agents that work alongside developers for their projects. I just

38:10 had the great tables guys from Posit, and they've got a whole section on agents and skills and

38:16 stuff so and CLI so that if you want to create documentation it's not just you create it but

38:22 here's the tools that the agents can use to work this to help you create it as well and it sounds

38:26 like you got something real similar we started with developer experience because the insight

38:32 I had was that the best developer tool this is from from reviewing thousands of tools with console.dev

38:38 but the best developer tools have the best developer experience so that means good documentation

38:42 I mean, it means proper comments that your editor can surface as the IntelliSense type

38:51 pop-ups that show up in your editor to give you hints by what you should implement,

38:55 creating examples.

38:56 And we started in 2023 before coding agents really became as dominant as they are now.

39:03 But what we created by accident was this idea of agent experience, because good developer

39:09 experience is very similar to what you need to do to have good agent experience and so today that

39:15 means we have skills you can install we've got an mcp server so you can query your data or your agent

39:21 can query your data and to help you understand what you need to do we've got a cli and we make

39:27 it super easy for you to just point your coding agent at our docs or our lms.txt and then it will

39:35 figure out what needs to be done.

39:37 Like if you scroll up a little bit, the CTA that we have on the homepage, the

39:41 first one is start with a prompt and you can go to the code if you want, but

39:45 install the skill and then you tell your agent what to do.

39:49 And where developers would often get stuck is you've done that initial installation or then what?

39:55 Because that's in your code, you have all the context of who the user is and

39:59 you can log rather than blocking.

40:01 And when you deploy, your agent can follow along, can follow the logs.

40:05 It can connect to Arcjet.

40:06 It can pull all the data out.

40:08 And then it can make suggestions to change the rules to avoid false positives and then

40:12 commit them for you.

40:13 And so what you get is this security partner that works with the Arcjet building blocks

40:18 and helps you solve these security problems.

40:20 I love it.

40:20 That's a really great onboarding experience.

40:23 I feel like there's this split of a lot of people having great experiences with AI agents

40:28 and a lot of people not having them.

40:30 And I know there's a lot going on there, but a lot of people not having great experiences,

40:34 I think there's just steps they're missing or techniques they're missing, or they're choosing a really, really free, simple model.

40:41 But things like this, like you just automatically run a single command to install the skills.

40:46 And now AI is already better with this.

40:48 It's really, really cool that you've thought through it.

40:50 Obviously, having the AI work within your system.

40:53 So when I'm using your SDK, is it actually doing AI at runtime or is AI sort of around the more static side?

41:02 We use AI for the prompt injection detection side of the product.

41:08 And that happens when you build in that particular building block into your code.

41:12 But we have other components.

41:14 We do PII detection, so we can detect email addresses, credit card numbers.

41:18 And that happens entirely locally.

41:20 We actually ship a WebAssembly sandbox with the SDK, which runs in process in your environment.

41:25 And so when someone submits their credit card number by accident into your support form,

41:30 we do that analysis entirely locally and it never leaves your environment.

41:34 But to do something like prompt injection detection, that's where we run a load of specialized models on our cloud.

41:41 And so the call goes out to us and then we're running AI behind the scenes.

41:45 The difference is just the kind of thing you want to protect against.

41:49 Like if you're doing a form submission, then you want us to respond within,

41:52 we set an SLA of 20 milliseconds to respond to those.

41:56 So you would never notice.

41:57 And if it's inside the WebAssembly sandbox, it's less than a millisecond.

42:01 But if you're going to an LLM, then it may take a couple of seconds to get a response.

42:05 And that's normal as expected.

42:06 So we have a little bit of leeway to do our own analysis.

42:10 Typically, that's around 200 milliseconds.

42:12 Yeah, if you add half a second to someone else's LLM call, it probably, you couldn't tell the difference.

42:18 It could just be, oh, well, the LLM's a little busy right now, right?

42:21 Yeah, very cool.

42:22 Okay, so why start this company? Why start ArcJet?

42:26 I started it because I felt that the developer tooling for security just wasn't there.

42:33 And we had gone through in the previous few years, kind of 2020 onwards, this revolution,

42:39 certainly in the JavaScript ecosystem. And then it started going over into

42:44 Python and into the other ecosystems as well, that you would get these amazing tools,

42:48 You get amazing cloud platforms.

42:49 You get amazing CLIs and tools for developers to write code.

42:53 But it was five or 10 years behind in security.

42:57 And I thought we could bring this new approach, the developer experience to security in your code.

43:03 Something that's actually built for developers to try and solve problems.

43:06 And I thought we could build out a platform company to solve all these different problems.

43:13 But developers need these building blocks.

43:15 And so that's why we've kind of split the product up into these different areas.

43:18 So you can come to us for one thing, like prompt injection detection, but you probably also want rate limiting to implement budget controls for your AI tool.

43:28 You want bot protection to stop automation attacks against your application.

43:32 You want PII detection.

43:34 So all of these things start to compose together.

43:36 You can use them individually or you can bring them into a multi-rule security policy.

43:43 But that's the reason for creating this.

43:44 Seems like a great idea to me.

43:46 So Startup Row, how'd you end up there?

43:49 Yeah, I was following the PyCon blog for a while because I knew we were going to do a Python SDK

43:57 and wanted to start getting into the community.

44:01 Python's actually the first programming language and I really learned and I've been writing it since 2008, 2009.

44:07 So back in the Python 2 era.

44:10 But we started out.

44:11 The two to three wars.

44:13 You're a veteran of the two to three wars.

44:14 Yeah, I know.

44:15 Luckily, I kind of stopped writing Python for a bit with my last startup.

44:20 And then when I came back to writing code, it had already gone on to the 3, version 3.

44:25 So I kind of skipped that pain.

44:29 But we started with JavaScript and we just had a JavaScript SDK because all the new applications were being built using JavaScript and TypeScript.

44:38 As AI started to come onto the scene properly kind of end of 2024 into 2025,

44:45 Python really started to shine again because it's been native in the data community,

44:51 the data science community for a long time.

44:53 And it's where all the AI work is happening.

44:56 And it started to be used for the AI backend APIs again.

45:00 And we just built our own backend.

45:03 I'm using the Open Inference Protocol and that allows us to run different models

45:07 but have an abstracted layer internally because our actual core infrastructure is written in Go.

45:14 That's what I moved to from Python to Go.

45:17 But we've decided we were going to release the Python SDK, which came out at the beginning of this year.

45:22 And that was just in time for the startup row application and to attend PyCon

45:28 and start getting it in the hands of Python developers.

45:31 Nice.

45:31 Is that a little bit like Open Router and those other things, this middle layer?

45:36 It's similar, yeah.

45:36 The challenge with the models is they all require different things.

45:40 They have different inputs.

45:42 And if you're going to run multiple models, which is what we do, we kind of run the prompt injection detection in parallel

45:48 across multiple models and then compare the results.

45:51 And for our backend to have a consistent API where we get the result out, we want to get the token count,

45:58 we want to monitor the latency, we created a thin layer that uses the open inference protocol,

46:05 which is what our Go code connects to.

46:07 And then that abstracts away the differences of the different models.

46:10 And that makes it a lot easier to just build our internal client server architecture.

46:15 Makes sense.

46:16 So how did Startup Row go?

46:17 Did you make some good connections?

46:19 Did you feel like it was worth going?

46:21 Yeah, it was great.

46:22 And it allowed us to get in front of the community with the support of the PyCon organizers.

46:29 It was interesting comparing to other conferences because to date, we've been mainly at JavaScript conferences.

46:35 And the type of questions that we get really shows the understanding of building AI workflows, building AI APIs, building the data models and the data workflows behind the front end applications, which are still typically built in JavaScript.

46:52 And just the questions are very different.

46:54 And you could utilize a different type of engineer at PyCon than you do a JavaScript conference.

47:00 Yeah, there's some really clever people there.

47:02 I always feel like I'm talking to some really smart people in the room when I'm interacting with the PyCon attendees.

47:07 So you talked a bit about the tech going on behind the scenes.

47:12 It's interesting that you have Go, I think, there.

47:15 It really seems like that's server-side type of programming first is what Go is for.

47:22 I think people think Go Rust is kind of similar, but Rust is a little more lower-level operating system type of thinking,

47:29 I think, whereas Go is all about async and programming.

47:33 What are your thoughts? I know you're not doing it, so there's no reason to switch over.

47:36 What are your thoughts about using some of the modern Python API things for your backend?

47:41 We use FastAPI and we use an ASGI app that's hosted on Modal for our AI inference.

47:48 Modal gives us the GPUs and the hosting environment auto scales.

47:52 And then our applications are just deployed in containers.

47:56 The Go API, the Python SDK that our customers use is Go because it's a gRPC API.

48:05 And Go has the best libraries available for gRPC because gRPC is written by Google and Go is written by Google.

48:12 So they have the best APIs.

48:14 I mean, yeah, they were built to go hand in hand.

48:16 I feel like gRPC never really spread beyond Go.

48:21 I mean, there's some options in Python to use it and some other languages, but it's kind of like,

48:25 It still feels proof of concept to me.

48:27 I don't know.

48:28 I've looked at gRPC as well.

48:29 I'm like, that'd be pretty neat, but no, just do JSON or something.

48:32 Yeah, we did it because of the performance.

48:35 So we talked a little bit about the latency.

48:37 And if we're sitting in your request path, we want to respond as quickly as we possibly can.

48:42 Like ignoring the prompt injection detection, which adds a couple of hundred milliseconds.

48:46 When we're doing bot detection, when we're doing rate limiting, we want to give you a response within just a couple of milliseconds.

48:51 And so when you install our Python SDK, it opens an HTTP2 socket that maintains persistent connection with our edge network.

48:59 And that removes some of the latency because you don't have to establish a connection every time.

49:05 But then we're making RPC calls using gRPC to minimize the latency because the wire protocol is binary.

49:12 So it doesn't have to do the heavy work that you do when you're parsing JSON.

49:17 and that just makes it a lot more performant and means that when the communication is happening we

49:23 can we have backwards compatibility with the protocol layer it just gives us a lot of these

49:27 nice properties and that allow us to have a stable but perform an api yeah it makes a lot of sense

49:33 i'd probably use msgspec these days if i was really really worried about packet i think gRPC is

49:39 also great but like if i was had to fit it into the python space i'd probably go msgspec

49:44 I think that's probably the tightest, most binary packing you can get these days.

49:48 Yeah, definitely.

49:48 There are some interesting options now, but we support multiple languages.

49:54 So we started with JavaScript.

49:56 There's some really good generators for gRPC in JavaScript.

50:00 And we've just done a Go SDK as well, and we're going to go to other languages.

50:05 So we looked at a few different options, and this was 2023, so it's a few years now.

50:12 You had a lot of time to think about it.

50:13 no one was going outside anymore.

50:15 What a weird time.

50:17 So give us, I want to maybe just get your perspective here since you've lived on both sides of these fences.

50:23 How is JavaScript, TypeScript security different than Python right now?

50:28 That is a good question.

50:30 I think PyPI hasn't had the same problems as npm has.

50:36 And it's probably a factor of its popularity with where all the attention is.

50:42 That's going to change because these package registries are really set up

50:46 almost as volunteer organizations.

50:49 And certainly PyPI is run by the Python Foundation.

50:52 So that is a kind of formal connection there.

50:55 But they don't get the funding of a megacorp that should be able to invest huge amounts in security.

51:02 Of course, npm is run by a megacorp.

51:04 It's run by GitHub, which is Microsoft.

51:06 And so they should have the resources to really improve security.

51:09 But that just hasn't...

51:12 Still, they're struggling under all the attacks.

51:13 They're struggling.

51:14 They just haven't allocated the resources it really deserves.

51:20 And this is just a problem with the architecture because they were built in an era

51:23 when this just wasn't an attack factor.

51:26 It was just so much trust around open source.

51:29 It's like, it's built for the good.

51:31 They put it out there for you to use.

51:32 It has a license, just install it.

51:34 Well, maybe that's not the only problem.

51:37 Yeah, that's right.

51:38 It's just they provide a very basic service.

51:39 It's basically just hosting your code in a certain way, but that's been abused.

51:43 And it's multiple orders of magnitude increase in traffic.

51:48 And that just means you're going to get bad actors and they're going to exploit it.

51:51 Yeah.

51:52 I honestly think that with all the agentic AI helping people write little bits of code here and there

51:57 and stuff like the security supply chain thing that we're talking about,

52:01 there's going to be a lot more vendoring.

52:02 And if you only need one or two functions out of a library, maybe you could just have Cloud write it

52:06 and you don't have to depend on anything and worry about it.

52:09 yeah it's we'll see where it goes in the next five years agreed yeah that's why google's

52:13 always run things they always vendor all their dependencies and now i think if it's just a basic

52:18 dependency that's not really doing that much you can get your coding agent to do it for you

52:23 i one of the things that i always talk to our customers about is that well you can write your

52:28 own frame limiting library you can do that pretty easily and use redis but can you build your own

52:32 bot detection and deal with all the kind of the arms race of keeping up with automation that's

52:39 where we think we add value and i think that's the way we're going to see what dependencies is most

52:43 of them you're going to write yourself or get your ai to do it but there's going to be a few key

52:47 things that really you shouldn't be doing yourself like crypto cryptography not cryptocurrency

52:53 that's going to be the that's going to be the questions like what is the crypto equivalent

52:57 yep 100 if you see people doing 100 from scratch their own crypto that's pretty much a

53:02 very big one, a red flag. So speaking of, you know, what people are going to be doing,

53:07 what's the roadmap? Let's close it out with the roadmap for Arcjet. Where are things going from

53:11 what you have now? We're going to add more language support. And it's not just a case of

53:16 code genning a library because we have the WebAssembly component in there that does a lot of

53:21 the security analysis in your environment. And that requires a bit of engineering work for each

53:27 platform. In JavaScript, WebAssembly is native within Node, within the runtime, so we have to

53:33 write the bindings, but it's relatively straightforward. In Python, we use Wasmtime,

53:38 which is an open source runtime for Python. And in Go, we use one called wazero, but different

53:47 ecosystems have different packages because often WebAssembly is not natively built into the runtime.

53:53 And so that's where a lot of the work goes, is making sure that WebAssembly works in a way that is performant and native.

54:00 So we're going to add more language support.

54:02 That's the main thing.

54:03 Okay.

54:04 Yeah, that's, I mean, got a solid foundation.

54:06 Now you just want to make it available to all the different users and developers of different platforms.

54:12 You know, it would be pretty interesting.

54:14 In Python, we've got the CFFI and the C bindings and all these different interop things and Rust through PyO3.

54:21 It'd be kind of interesting to just have a native execution of Wasm in the runtime without

54:26 some kind of adaptive layer type of thing.

54:28 I think that would be cool.

54:29 Definitely, because I want to have absolutely nothing to do with C in a security focused

54:35 application.

54:35 I can imagine that.

54:37 Yeah, definitely.

54:38 It should be built into the runtime, but then we had the problem of minimum Python versions

54:43 and all those kinds of things.

54:44 And I think Python 3 people do typically stay more up to date, but it's still a problem when

54:50 you require people to be on the very latest version of the runtime.

54:54 Yeah, 100%.

54:55 A lot of people just take what comes with Ubuntu or something, which is usually a year behind.

54:59 All right, David, I appreciate you taking this time to chat with us.

55:03 Congratulations on ArcJet and good luck for the future.

55:06 Great.

55:06 Thanks a lot.

55:07 Yeah.

55:07 Bye.

55:09 Our next startup is femoral.dev.

55:13 And we're going to be talking to Chinmaya.

55:16 Chinmaya, welcome to Talk Python.

55:18 Excellent to have you here.

55:18 Hey, super nice to be here.

55:20 Fun times hanging out in PyCon, meeting up on Startup Row.

55:23 That was a really cool setup there.

55:26 Yeah, it was really good.

55:26 It was a pretty awesome opportunity to be there.

55:28 So glad we made that happen.

55:31 I can imagine.

55:32 I was talking to one of the organizers and the success rate of startups on Startup Row

55:37 is really high.

55:38 Hoping to follow suit.

55:38 I hope for it.

55:40 Exactly.

55:41 Let's keep it rolling, right?

55:42 Well, let's actually begin by having you do a quick introduction for the listeners.

55:46 Sure.

55:47 Yeah.

55:48 I'm Chinmaya.

55:49 I'm the founder of Femoral.

55:50 We're like a hosting platform for Python-based web apps.

55:55 And yeah, just excited to be here.

55:58 Awesome.

55:59 I think hosting's been around forever, right?

56:02 Like cPanel land and all that weird stuff.

56:04 But it seems like there's a lot of opportunity to create better opportunities, better developer

56:09 user experiences, and also cost, right?

56:12 Yeah, that's kind of our angle, right?

56:15 is like, obviously you can go and spin up a server.

56:18 Pretty much every company under the sun offers that service.

56:22 But, you know, we really want to focus on the developer experience, making it as seamless as possible to actually get your code

56:28 hosted into a place where it's live and without having to, you know, deal with the manual steps

56:35 and extra problem surface area that come with, you know, potentially managing your own hosting.

56:41 And like you mentioned, the cost, that's also a factor for us because, you know, especially when you have persistently provisioned compute, you really

56:48 have to size and scale your servers such that they don't, you know, break your wallet, break

56:54 your bank account.

56:55 And so you're kind of dealing with two problems, right?

56:58 It's like, is my server or set of servers actually going to be able to handle all of this traffic?

57:03 And at the same time, am I going to be able to pay for all of this?

57:06 Yeah, that's both the promise and the potential downfall of the cloud is it's easy to set up

57:12 a few things.

57:13 they won't cost very much unless they take off and then maybe they do.

57:15 Yeah.

57:16 And then, you know, you're spending all this dev time, you know, trying to manage all that

57:20 infra on top of that.

57:21 So, you know, if you equate, you know, potentially your own developer time or your team's developer

57:26 time to money, you know, that that's another cost that you have to think about.

57:29 What about AI and specifically vibe coding?

57:32 I'm not a huge fan of vibe coding as a concept, although it's super fun to just come up with

57:37 a crazy idea and send the agent off and go, I wonder what it does.

57:40 Right.

57:40 But as a form of engineering, I'm not a huge fan of it, but I do think that we're in an emerging world where there's many people who have programmed in a sense and they have an app and then they're just completely like, well, how do I get it on the internet?

57:55 You know, because they're not really developers, right?

57:57 And there's like, I don't Linux, I don't DevOps, but I still need it on the internet.

58:02 How do I go about that?

58:03 Right?

58:04 What do you think?

58:04 Yeah, I mean, that's, you know, a massive cohort that's, you know, kind of emerging into the new

58:11 space of builders, right? And, you know, they want to host their stuff in a place where it's live,

58:16 and they don't have to, you know, deal with the extra, you know, headache of figuring out,

58:21 like you mentioned, what is all this Linux stuff? They kind of just want to be like, hey, I vibe

58:25 coded this Flask app, this Django app, this FastAPI app, and it's really cool. And I share with all

58:29 my friends, how do I do that? Right? Where do I even start? Like, what is a server?

58:34 These types of questions are really a big blocker, right?

58:38 And that we do serve those people as well, quite well.

58:42 Yeah, I don't want to position you initially as only serving VibeCoders.

58:45 It just, it seems to me like that's a whole nother area of people who really could be

58:49 served from a simplified deployment experience.

58:51 But there's others, you know, I'm thinking data science, honestly.

58:55 There's a ton of data scientists who have built something amazing, but they're not really

59:00 DevOps software engineer side.

59:02 They come from more from the science side and they've done really cool work, but it's a big step to go from I've got my notebooks running to now I have a FastAPI app to now I have it on Linux.

59:15 Yeah, no, there's a ton of like beginner cohorts, I would say, that, you know, are served really well.

59:22 For us, we're like mainly focused on, I would say, two main cohorts of people that we see as our ICP.

59:29 One is like smaller development and startup teams, right, that have a Python stack.

59:34 And, you know, they want to spend most of their time in the business logic and actually

59:38 serving their customers' needs and, you know, focusing on their customer feedback rather

59:42 than managing their internal hosting solutions.

59:45 So they, you know, wouldn't have to, you know, hire someone to do that or spend their dev

59:48 time, you know, on those tasks.

59:51 And then the other cohort are, you know, agencies and consultants, kind of by the same token,

59:56 but they're served a little differently by one of our core features, which we kind of haven't talked about yet, which is the managed cloud.

01:00:04 Yeah, tell us.

01:00:05 Let's take a step back and maybe position Femoral around.

01:00:09 On one hand, we have infrastructure to the service, EC2, DigitalOcean, Hetzner, VPCs, whatever.

01:00:14 And on the other, maybe you just upload some code somewhere.

01:00:18 Where do you all live?

01:00:21 I think I may have missed a couple words in there because of internet, But I am going to say we fall kind of in the platform as a service side of things where you want a place to host your apps and you just want to be able to throw them up there without having to deal with these servers manually.

01:00:42 Yeah, exactly.

01:00:43 You got the gist of it.

01:00:44 So basically, it's sort of a continuous deployment type thing.

01:00:47 Create a certain branch, like a production branch or whatever.

01:00:50 Connect GitHub to it.

01:00:51 Push that branch.

01:00:52 Off you go.

01:00:52 Like merge a PR and it's live, right?

01:00:54 Yeah, pretty much. And then all of your code, without having to dockerize or containerize,

01:00:59 because of that build pipeline we built, will get built and will then deploy it onto our managed

01:01:04 cloud, which uses our serverless orchestrator to scale up and scale down and manage the amount of

01:01:12 instances slash containers slash VMs that are running your code relative to the amount of

01:01:18 traffic that's coming and the amount of resources that your code base is using at any given moment

01:01:23 will be properly provisioned to match that traffic.

01:01:26 So that's kind of like what we do in an end-to-end sense.

01:01:30 Code uploaded, we build, we deploy, then we scale.

01:01:33 So those are all the things we take care of.

01:01:36 Yeah, awesome.

01:01:37 I see it says now in early access.

01:01:39 How do I get early access?

01:01:41 Yeah, you just click that button that says get started and you sign up.

01:01:45 Yeah, there's no, really, there's no gating here.

01:01:49 Yeah, got it.

01:01:50 I have talked to some folks who are like, yeah, not yet, but you can apply and hopefully, so we'll see.

01:01:55 Yeah, we're wide open.

01:01:57 We would love for anyone who has something they'd like to deploy to come check it out.

01:02:01 Sweet.

01:02:02 Startup Row.

01:02:03 What do you think?

01:02:03 How'd you get there?

01:02:04 Why do you put the time and energy into that?

01:02:06 Yeah, Startup Row, it's pretty awesome.

01:02:08 You know, I was in contact with some of the organizers and, you know, I don't think I really

01:02:16 had a strong sense of what Startup Row would be like before I actually got there.

01:02:21 given that it was also my first PyCon at the same time.

01:02:23 So there's a lot of new happening.

01:02:26 But I would say I did it, you know, to just kind of get more involved in the community, right?

01:02:33 Like as the company were, you know, made for Python dev.

01:02:35 So naturally I want to be, you know, involved in the community.

01:02:39 And then once we got there, it was pretty spectacular, you know, got to meet and talk to like so many developers

01:02:46 and people with different problems.

01:02:48 And, you know, people would come up and say, oh my God, this is something that I really need right now.

01:02:52 And then you'd have people that come up and be like, this is so foreign to me.

01:02:56 I don't even understand why I would need this.

01:02:57 So there's like, you just get the whole spectrum and it gives you a really good perspective.

01:03:02 That's cool.

01:03:03 I'm glad you had a good experience.

01:03:04 I don't go to PyCon every single year, but when I do, I always really enjoy being there.

01:03:09 It's great to just be with people in the community doing really smart and interesting things.

01:03:13 Yeah, no, I learned a lot more about the community than I knew before I got there.

01:03:19 Yeah. I think it's also, as a startup, it must be interesting to talk to people and go,

01:03:22 I don't think I need you. What is this? Why do I need this? Because either it helps you focus in

01:03:26 on your ideal customer and say you're actually outside of that slice of the world, or position

01:03:31 your messaging a little bit, right? Like one of the big stories of startups is talk to customers,

01:03:36 talk to customers, like so much easier said than done though. Yeah, no, definitely. You're right

01:03:40 about that. I would say we were able to validate our ICP because the most interest we got was from

01:03:47 smaller startup teams and from agencies slash freelancers or consultants that were, you know,

01:03:53 dealing with clients and had all these staging environments that they had to manage and all this

01:03:56 extra infra. So like kind of validated that in a sense, but, you know, I guess at the same time

01:04:02 invalidated outside of our ICP simultaneously because of those conversations I talked about

01:04:06 earlier. I hadn't really thought about the agency side, but a lot of what happens with the agency is

01:04:12 somebody comes in and builds something really nice and they get it working and then they hand it off

01:04:15 to somebody who's presumably not quite as familiar with the tech because otherwise they might have just built it themselves.

01:04:21 So the concept of, look, you just pushed to this branch.

01:04:24 Yeah.

01:04:24 That's got to go over well as a handoff.

01:04:26 Right. It's great for a handoff.

01:04:28 And it's also great for managing, you know, like mock-ups and staging environments, right?

01:04:32 Like say you have, you know, a number of clients and you send them a mock-up or two or a live demo every week.

01:04:39 And then you have to manage kind of this extra staging environment for every single one of these clients.

01:04:44 it becomes a lot more infrastructure work for you as a company compared to like, say,

01:04:49 you're just a startup and you just have one product and, you know, you need one staging

01:04:52 environment for that one product. Right. So like for agencies, this kind of infowork can kind of

01:04:57 increase pretty rapidly as the clients increase per client almost. Yeah. Right. Interesting. And

01:05:01 then the scaling, right. Between each client. Right. So there's like, you know, I could get into it,

01:05:05 but we only have so much time. Indeed. So how did you sort of told me how your experience was,

01:05:11 but as a startup, how was startup row for you?

01:05:14 Yeah, I mean, it was great.

01:05:16 We, you know, we've made some really good connections for potential partners going forward, potential customers.

01:05:23 So I would say it really helped grow the business and gave us, you know, like, I guess I already said this,

01:05:28 but gave us perspective on like what certain, you know, personas perceive of the product,

01:05:34 you know, given their context and their background.

01:05:36 I found it very interesting to just walk around the expo floor and kind of just get a sense of what have people come to promote, right?

01:05:43 You get one time a year to come talk about something and what is their, what does their banner say?

01:05:48 Who decided to come?

01:05:49 Who decided not to come?

01:05:50 And yeah, it's really, it's a neat experience.

01:05:54 Definitely.

01:05:55 Tech, how much do you want to talk about tech behind Femoral?

01:05:58 I think it'd be interesting, but I don't want to make you give away secrets.

01:06:02 So tell us what you're comfortable sharing.

01:06:04 Yeah, it's fun.

01:06:05 So I guess I can kind of talk about the functional experience of what the tech enables, which is that, you know, we built this build pipeline.

01:06:18 And essentially what we're doing, in a sense, is, you know, at a high level, going through your code base and figuring out like, okay, what are the tools you're using?

01:06:25 What are the libraries you're using?

01:06:26 What do we need to package?

01:06:27 What do we need to build?

01:06:29 And, you know, like that's quite straightforward, right?

01:06:31 You're just kind of parsing code.

01:06:33 It's largely a solved problem.

01:06:35 But then, you know, I guess the more technically interesting side of things is the like serverless orchestration that we've written in the managed cloud.

01:06:42 And so that's kind of based off of, you know, these like fast starting VMs that we use and some kind of, you know, way to take advantage of, you know, really fast cold starts with those fast starting VMs.

01:06:56 Right. So like, you know, when someone does have something deployed serverlessly and it is scaled to zero, well, when they hit it, they don't want to be sitting there for a minute.

01:07:03 right waiting for some instance to spin up right so like a really interesting cost level right

01:07:07 because if you want to keep one running so it's not a cold start but that that that costs money

01:07:12 but if you want to turn it off maybe it's well i don't want to wait 30 seconds for the first request

01:07:16 right right and that's like really uh what we wanted to avoid and what generally happens with

01:07:21 persistent compute so that's like one of the reasons that we as a company decided to build

01:07:25 on serverless compute um in order to like you know drive this experience where you can have all these

01:07:30 things deploy and they've all scaled to zero, but you're not, you know, waiting on a cold start when

01:07:33 you do actually want to use something. And that's kind of powered by these fast starting VMs that

01:07:38 we've kind of built everything on top of. Okay. What can I host there? Look at your website. It

01:07:44 sounds like the standard Python stack sort of thing. Yeah. It's like, I mean, that's kind of

01:07:49 like just the biggest subset of the set of things we can deploy on Femoral. Mostly it's anything

01:07:55 written in python exposed with an api and that you wouldn't need gpu compute for because right now we

01:08:01 don't have for gpu compute for many reasons that that's gpu is really interesting it's as soon as

01:08:07 you go down that path it's like well you could start at a thousand dollars and go up from there

01:08:11 rather than ten you know it's like yeah it's tough to offer a reasonable price point with gpus yeah

01:08:17 i'm sure that it is um so one thing i see the web frameworks and they all look real common of

01:08:23 Of course, if you can do all those three, then you can do others, I imagine.

01:08:27 What's the story for data access?

01:08:30 Databases, Postgres, SQLite, Mongo, others?

01:08:32 Yeah, so I guess this kind of feeds into our roadmap, which is we're now working on building serverless Postgres,

01:08:40 like managed serverless Postgres, to add right into Femoral.

01:08:44 So you deploy your serverless web app, and right next to it, you deploy your serverless Postgres.

01:08:49 They talk to each other.

01:08:50 It's all in the same platform.

01:08:51 Everything's managed under the same billing.

01:08:53 So it's really straightforward.

01:08:54 So yeah, right now, if you were to deploy in Femoral, your database would live outside of our compute.

01:09:00 But we are very soon bringing that into the scope of what Femoral does.

01:09:04 Got it.

01:09:04 Do I get to pick my cloud?

01:09:06 Do I say EC2 or like, sorry, AWS or Azure or DigitalOcean or whatever?

01:09:12 Do I choose that then pick up a managed database in that location?

01:09:16 Or how do I relate those two things?

01:09:18 Yeah, so you can choose to locate your database.

01:09:22 like when our database managed progress is offered, you can choose to locate that in whichever region,

01:09:29 basically any AWS region. Although there are obviously other hyperscalers that we could offer

01:09:34 that in right now. And in terms of the actual web app compute itself, that is not necessarily

01:09:42 provisioned into a specific region. But we are working on how we might communicate that to the

01:09:50 user. Sounds good. And I guess anything else on the roadmap that you want to touch on, or is that

01:09:55 pretty much covered the database? Yeah. I mean, basically, right. You have the stateless compute,

01:10:00 and then obviously you need some states somewhere, right? So that's all we're working on, right,

01:10:03 is the Postgres. People want, obviously, Redis, some kind of key value. So managed Redis will

01:10:10 probably come with Postgres. Yeah. Chinmaya, congrats on starting Femoral and best of luck

01:10:16 to you guys and nice to meet you at PyCon. Thanks for coming on the show. Yeah, for sure. Nice

01:10:20 meeting you too. See you later. Now we meet Capicio in Beyond, its co-founder. Hey, Beyond,

01:10:27 welcome to Talk Python To Me. Yeah, thank you so much, Michael. Really great to be on the show.

01:10:31 Yeah, it's fabulous to have you be part of our startup row segment, stroll down startup lane,

01:10:37 if you will. I think it's such a neat idea that PyCon, the PyCon organizers, Jason and Shea and

01:10:43 everyone else put that together. And so it's just a neat look into what's up and coming in the Python

01:10:48 space. So we're going to talk about Capicio. Capicio. Yeah. Capicio. I love it. This is your

01:10:55 business. And we're going to dive into why you want Startup Row and all those things. But before we do,

01:11:00 quick introduction. Who are you? Yeah, sure. Thank you. So my name is Beyond Denote. I am the founder

01:11:05 of Capicio, we refer to ourselves as the authority layer for AI agents. So identity policy, basically

01:11:14 to be able to secure agentic environments so that they're ready for production and your

01:11:20 SecOps teams aren't freaking out when you're trying to deploy agents to prod.

01:11:25 It's really tricky, right? We've got this non-deterministic thing running, but it's supposed

01:11:30 to be operating in a professional or legally structured environment, right?

01:11:36 Yeah.

01:11:36 It's a massive challenge.

01:11:38 It's blocking a lot of POCs right in its tracks at the moment, or even some organizations

01:11:43 from not even considering AI solutions.

01:11:46 So we're trying to bridge that gap and solve that problem.

01:11:49 You know, I'm reminded of some stat that was probably just made up or whatever, or very

01:11:55 vague, like 97% of all AI projects fail.

01:11:59 And I have no idea what that even means.

01:12:02 Does that mean like they tried to add AI to an email client and they just, it couldn't

01:12:06 be added?

01:12:07 Like, no, my email client just has a very bad representation of AI in it.

01:12:11 Yeah, I have no idea.

01:12:13 Yeah, no, no.

01:12:14 I've seen that stat as well.

01:12:16 I think the source was some MIT paper that ended up maybe not being quite as well researched.

01:12:22 So I don't know what the real stat is.

01:12:23 Maybe we're somewhere close, but yeah.

01:12:26 Yeah, I think it's a much higher level of success.

01:12:28 but things like what you all are building are sort of key to that success, right?

01:12:33 Yeah, absolutely. It's a whole new ecosystem. I mean, if we go back to the early 2000s,

01:12:38 we invented this thing called the internet and we had all these great ideas, but there was a lot

01:12:44 of infrastructure still missing to be able to realize that. And at the time we didn't really

01:12:49 know what infrastructure we really needed yet. With the whole agentic era, I think we've hit a reset

01:12:55 button on a lot of that except we're a lot more technologically mature so i think we teams know

01:13:01 what they need and what they're missing and we're in this build race to to be able to close those

01:13:06 gaps interesting i do think it's just absolutely the wild wild west is not the right term but it's

01:13:11 just an unknown we don't know what we need we don't even necessarily to some degree know what

01:13:16 we don't know for some of these products but it's changing fast i was sort of thinking about that

01:13:22 that one when i threw out when your response to that that stat i threw out is like how much of

01:13:26 that is based on 2024 ai you know what i mean right it moves so fast i mean even six months ago and

01:13:33 you're looking at a different beast you know it's uh it's incredible the iteration pace is so fast

01:13:39 yeah so why'd you start Capicio yes uh good question so um i obviously had been following

01:13:45 the ai space um you know jumping on everything that that i could to be able to learn more and get

01:13:51 into the space.

01:13:52 But I wanted to do something meaningful.

01:13:54 I wanted something infrastructure level.

01:13:56 And then when Google released the first iteration of the A2A protocol, the agent to agent protocol,

01:14:03 I think it was somewhere around March last year, March 2025, a few light bulbs started going off.

01:14:10 Because here we were creating a new standard for agents not to communicate with MCP servers or human agent.

01:14:19 This was agent to agent communication, which we knew was coming, but the concept blew my mind.

01:14:27 And then looking at DeepMind, Google DeepMind also released a few papers on intelligent delegation,

01:14:34 agent to agent delegation.

01:14:35 So looking at that, from where we were, we're just using basically LLMs and chatbots

01:14:42 to a layer where we had agents self-orchestrating and trying to do that at scale.

01:14:48 That's where, like I explained, early 2000s internet, we're missing a lot of infrastructure.

01:14:53 And for now, being able to see that realization for that vision, there's a lot of infrastructure missing.

01:14:58 So that's where I started building, particularly around the trust layer,

01:15:04 the authority layer, trying to make sure that agents can communicate securely, safely together,

01:15:11 and that we can facilitate that with all the missing layers that we have right now.

01:15:15 Well, it's definitely a challenging problem, right?

01:15:17 We used to write tests or have security scanners, but now, what are some of the new challenges,

01:15:23 I guess, that you see out there that are different than prior stages?

01:15:28 Yeah, good question.

01:15:30 So one of the challenges has to do with identity.

01:15:35 When I started working on the problem, I was focused more on setting guardrails for AI

01:15:40 agents, helping them to communicate together, being able to learn more about each other,

01:15:46 But being able to grant every agent an identity was very important.

01:15:52 Right now, it seems like the current trend is that we're trying to expand the known models around IAM into agentic identity, which is sort of a band-aid right now.

01:16:04 But one of the biggest restrictions is that those are usually within a singular domain.

01:16:10 And cross-domain open web communication, being able to extend that identity concept to really the future that we have with agentic AI on the open web, that becomes a big challenge.

01:16:23 And then along with that, of course, we have what we're referring to as the trust layer.

01:16:31 Just like in human society, we have people we interact with.

01:16:35 And the first thing we even do before we buy a new pair of shoes or a restaurant is we check reviews, you know, on the merchant, on the actual product.

01:16:45 And that mechanism doesn't really exist for agents right now.

01:16:49 So in a way, there's some of those primitives that are missing to be able to really scale that Agentech future up to full speed.

01:16:58 Going to be weird, right?

01:16:59 Yeah, absolutely.

01:17:00 All these things, you'll have to just drop in and check on your team of agents and see how things are going.

01:17:06 You know what?

01:17:07 I can easily see a product manager or some kind of manager agent that you go and work with.

01:17:15 And then it's like, look, how's the whole thing going?

01:17:18 How's the team looking?

01:17:20 And then you scope it out.

01:17:22 You all go work on this.

01:17:23 Get back to me later.

01:17:24 It's going to be weird.

01:17:26 Yeah, we're definitely heading to that future.

01:17:28 I mean, we've maybe seen it in movies before, and it seemed like so far off, you know, but we're all definitely going to have our personal agent that just kind of knows what's going on.

01:17:41 And I don't think it's as far off as what a lot of us are imagining.

01:17:45 No, I don't either. It's going to be odd.

01:17:48 Okay, so Startup Row, that's where we met.

01:17:51 And while you all were at PyCon, why apply to Startup Row?

01:17:55 How'd you get there?

01:17:55 Yeah, no, thanks.

01:17:56 Good question.

01:17:58 So I have to be honest here because like a lot of folks who are now using Python and fluent in Python, I sort of backed into the Python community because around the rise of AI, obviously Python, I mean, we can refer to it as the native language for AI pretty much, right?

01:18:16 So started using a lot of Python each and every day.

01:18:22 And then this product that we built obviously has the SDK.

01:18:24 It has an MCP plugin for MCP server harnesses as well.

01:18:31 And then one day I got pinged by Jason.

01:18:34 You know, Jason and Shay organizing Startup Row.

01:18:37 And he gets pinged and he's like, hey, man, have you heard about Startup Row on PyCon?

01:18:41 I think you should really apply.

01:18:43 And I was like, wow, okay.

01:18:45 Checked it out.

01:18:46 learn about PyCon I'd never been before and Startup Row and it just seemed like a great

01:18:52 opportunity so I put the application in but like a lot of things you put in the application like

01:18:57 whatever you know let's see what comes of it and yeah it was a little while later got pinged to say

01:19:03 congratulations so that's how we landed on Startup Row at PyCon and man what a what a gift

01:19:10 what a privilege it really was a it was an awesome experience. Was that your first PyCon attendance?

01:19:15 It was. It was my first PyCon. I've been to a lot of other conferences. Even my partner who was with me at the event, he was just saying how he's worked so many other conferences and the vibe was just so different.

01:19:33 You know, when you're talking to people at PyCon, everybody was chill.

01:19:37 You were talking builder.

01:19:38 It was builder to builder conversations.

01:19:40 Hey, you know, what are you working on?

01:19:42 How are you solving this problem?

01:19:45 And there were just a lot of real conversations.

01:19:48 The vibe was great.

01:19:49 I really enjoyed it.

01:19:50 Yeah, I've been to a lot of different technology conferences as well.

01:19:53 And it is certainly unique.

01:19:55 And I mean that in a good way.

01:19:56 Yeah, yeah, no, absolutely.

01:19:58 So one of the things I think is really critical for startups is getting feedback,

01:20:02 talking to potential or active customers. And the advice is, well, you really got to talk to

01:20:07 customers. It's so hard to do. But on Startup Row, you get to kind of iterate on that over and over.

01:20:12 Did you get some good feedback? Yeah, absolutely. We weren't sure how many...

01:20:18 This was sort of a big validation test for us because we were curious about the timing of the

01:20:22 market. Is the market ready to have this conversation? And it was interesting from

01:20:28 developers point of view most to be honest were like oh yeah we got to do something about this we

01:20:33 haven't kind of got there yet um but then from from the folks who were working inside larger

01:20:38 enterprises or the ones who have very advanced solutions they were already at the point where

01:20:43 they they totally got it like wow okay this is awesome we need to check this out um so i mean

01:20:49 we're following up now we got a quite a few design partner opportunities which is the point that we're

01:20:53 we're at right now we're looking to work with teams who are experiencing this problem

01:20:57 And, you know, where they can start running our product and find where we need to extend it to make it work for them.

01:21:03 What about the tech?

01:21:04 Are you giving us a peek inside what core technologies you all are using?

01:21:09 Yeah, sure.

01:21:10 Absolutely.

01:21:11 So because it's an infrastructure project and there's cryptographic verification of identities, a lot of handshakes like that going on.

01:21:20 We've actually built the core of it in Go.

01:21:24 It's a Golang, you know, library.

01:21:26 And then we have all the wrappers and all the stuff that would benefit from native latency in whatever target SDK language.

01:21:36 So Python being the main one, that's the only one we have right now.

01:21:42 So basically, what ends up happening, the flow is you can use our SDK, one line of code.

01:21:47 We literally, that's why we had it on our banner at our booth.

01:21:52 one line of code to be able to connect, register an identity for that agent for an MCP server.

01:21:58 That's then cryptographically verified because the private key is stored right on the agent,

01:22:03 right on the server. And then with that identifier, you can now be able to create policy

01:22:09 around the tools and around the agent. What tools is it allowed to invoke? What are blocked?

01:22:17 what group policy, what organizational policy belongs to this agent or server.

01:22:23 And the neat part in how we designed it is those are all policies that are compiled into OPA bundles

01:22:29 and are actually cached right to the agent or right to the server.

01:22:33 So to be able to respond to requests is super fast.

01:22:37 It's a sub 10 millisecond because usually when we use security, we think, oh, well, there goes my latency.

01:22:43 So that was a problem we wanted to solve.

01:22:45 But that in essence is as far as we've gotten in our roadmap right now, we are working on a full public RFC stack.

01:22:54 So our next two RFCs, we're working on the intent layer, which is a whole other fun conversation.

01:23:00 But we're determined to try and crack that as well.

01:23:03 Yeah, yeah. Very cool.

01:23:04 You're not the only startup on Startup Row building the core server side components with Go.

01:23:09 Yes. And that's what we found out.

01:23:11 You know, when we were coming to PyCon, thinking Python community, we were like, okay, don't talk about Go.

01:23:18 This is Python.

01:23:20 Don't say anything.

01:23:22 And then we got some questions like you just asked, you know, were they really pressed?

01:23:26 And they were like, okay, no, that makes sense.

01:23:28 We get it.

01:23:29 That's like, you're like this product or that product, that product that uses a similar pattern, you know?

01:23:33 So, yeah, it wasn't an unusual way to implement it.

01:23:37 Python has a really interesting performance story and relationship with other native build

01:23:43 technologies or native languages.

01:23:45 Like, for example, Python itself runs on CPython, which is primarily implemented in C, not Python.

01:23:51 A lot of the ways that Python is made fast these days is through Rust.

01:23:55 So when you end up at a performance critical section, it's like there's a layer of Python

01:23:59 and then it's Rust, right?

01:24:00 Like my web server, the application server for my website and my course website, all the

01:24:06 things.

01:24:06 It's built in Rust and then it just delegates out to Python, out to Flask, basically.

01:24:12 So I think Python people are more than other technologies used to.

01:24:17 So there's this core of something and then this sort of mix of choose the right thing at the right place.

01:24:23 Yeah, yeah, no, absolutely.

01:24:25 And Rust was a consideration for me.

01:24:28 I came from a C background and Go just really intrigued me.

01:24:32 So we ended up building it and Go in and it's been a good result.

01:24:37 But yeah, I mean, especially when you're doing things like cryptographic verification and you're doing a lot of stuff over again, you wanting to scale to support many different languages, it does make sense to choose a core and then build around that.

01:24:53 But we're constantly tweaking, trying to make it still a really native feel for whatever target platforming, including Python, so that you can use all your customary tools, have a similar experience in your IDE, and now be able to have a way also that whatever agent you're building with, that they understand the structure and the semantics as well.

01:25:16 One more question before I let you go.

01:25:18 Oh, I imagine there's also a lot of Docker in there, right?

01:25:21 Yeah, absolutely.

01:25:22 So that was part of our stack as well.

01:25:26 One of the main reasons we introduced Docker was because a lot of enterprises, when setting up a tool like this, they'll test it on your cloud instance and be like, oh, yeah, that's great.

01:25:38 But can we put this in our private cloud?

01:25:40 Can we make it air-gapped?

01:25:41 I mean, I'm hearing a lot of stories about air-gapped AI solutions and whether they actually have a future or not.

01:25:47 But either way, we implemented around Docker so that it's an easy pull and set up to be able to run it.

01:25:55 I do think that there's, it maybe won't be the most common way, but I think there's probably some juice in the air-gapped AI.

01:26:02 Because with the air-gapped AI, you're going to need your own local models.

01:26:05 But, you know, for example, a PyCon and an NVIDIA plus Anaconda combo, they had their little AI cubes, you know, their sparks doing a really cool live AI demo.

01:26:17 And if you were a big organization, you needed, you really needed privacy.

01:26:21 I could see buying a 10 GPU cluster and just setting that up and make it go locally.

01:26:27 Yeah, it was interesting how many devs we spoke to who are working in highly sensitive environments.

01:26:34 There was even one developer who works with the voice and data control recorders for airplanes.

01:26:44 Obviously, situations like that, they very hesitantly introduce AI.

01:26:49 And if they do, it has to absolutely be very air gapped and sandboxed.

01:26:54 So, yeah, like you said, there's some applications where that is still a requirement, even though for most of us who have been on the cloud for however long feel it's a bygone era, but it really is not.

01:27:04 Yeah. And if I do, I'm not a believer of, oh, the AI bubble is going to pop any day now. But I do think if the pricing changes, all of a sudden, maybe buying a $5,000 machine is the economical choice in the long term. So in which case, that also comes back to that.

01:27:21 Yeah, yeah. And that's an interesting concept too, because obviously what we're paying today, even though we may bulk at it, it is heavily subsidized. We all know that. So either the subsidies run out and hopefully compute has a lot cheaper so that we can keep running or, you know, we have to make some tough choices about what we want to invest in and what projects are worth it. Or like you say, investing in our own hardware to run our own local LLMs.

01:27:46 Yeah, we don't have time to go super down this, but I do think that this extreme load on compute and kind of the subsidizing as well is a forcing function for creating much more efficient compute and forcing the models to execute more efficiently.

01:28:01 And I can certainly see a way where we just innovate our way through to where the actual price falls back to close to what we're paying.

01:28:09 As a data point, the latest NVIDIA H200, I don't know what it's called, H200 or whatever, is 10 times more efficient at difference than the one before.

01:28:17 By what, right?

01:28:20 So you do that a few times, all of a sudden, there's not a bubble.

01:28:23 It's back to normal.

01:28:24 Absolutely.

01:28:24 I mean, we've seen it before in many iterations, right?

01:28:27 Like the need facilitates innovation.

01:28:30 And that's probably where we're going to be now.

01:28:31 And we're facing, I mean, the concept of world models and other stuff happening to quantum computing, which I don't know what everybody says, five years, 10 years, whoever.

01:28:42 But I mean, all that is going to change the game.

01:28:44 So we'll see how it all comes together.

01:28:46 Beyond, thanks for sharing your thoughts.

01:28:48 Congrats on Capicio.

01:28:49 Thank you.

01:28:50 Appreciate that, Michael.

01:28:51 Yeah, thank you so much.

01:28:52 Bye-bye.

01:28:52 Take care.

01:28:54 Our final startup is Pixeltable, started by Marcel.

01:28:58 Marcel, welcome to Talk Python To Me.

01:29:00 Nice to see you.

01:29:01 Michael, thank you for having me.

01:29:03 Great to have you on the show.

01:29:05 Very cool that you all were part of Startup Row, and I'm excited to hear about Pixeltable,

01:29:10 your new company that you're getting going, which is exciting.

01:29:13 Before we dive into all that, though, you've got quite a history in the Python space.

01:29:17 Tell people who you are and what you've been up to.

01:29:20 Yeah, happy to.

01:29:21 So my background is really in database internals.

01:29:24 I did that at grad school a long time ago.

01:29:27 I worked at a bunch of database startups in the early 2000s.

01:29:31 I was at Google from '03 to '10, also working on scalable data infrastructure.

01:29:38 And toward the tail end again on a new internal database system, what we called hybrid transactional

01:29:45 analytic processing called F1.

01:29:47 And after leaving Google, I joined Cloudera and started a project there called Apache Impala,

01:29:55 or later it became an Apache project, a scale-out SQL engine.

01:29:58 And as part of that, I also co-created the Parquet file format because there was no good open source columnar format at the time.

01:30:07 So this was basically the motivation for creating Parquet as basically an open source implementation of Column.io from Google's Dremel.

01:30:17 And so, yeah, so that's really my background.

01:30:19 And I got interested in the, I want to say the ML space and computer vision by coincidence when I was an EIR at a venture firm and met a computer vision person there.

01:30:30 So this was early 22.

01:30:33 And I ended up then talking to a whole bunch of computer vision engineers and managers of engineering teams.

01:30:40 And they're obviously all used Python.

01:30:42 And all of that happens in the Python ecosystem.

01:30:45 And that was kind of the motivation for creating Pixeltable.

01:30:49 back then as a, I want to say, a unified view of data and storage and also the transformations

01:30:59 that you need when you're doing computer vision work, in particular model data set curation,

01:31:06 training, and so forth. So that's kind of the origin story of Pixeltable. And I'm happy to

01:31:13 talk about what it is and how it has evolved since then.

01:31:17 Yeah, that's very cool about all the database work that you've done in Parquet.

01:31:21 And it's really clear how that's led you to Pixeltable.

01:31:25 Yeah, it's I think, you know, people are still saving a bunch of CSVs and having huge disk usage because of all that and slow parsing.

01:31:34 And they should look at Parquet, right?

01:31:36 Yeah, I mean, Parquet has, you know, obviously since then become a real industry standard and is also the core of, you know, other industry standards such as Iceberg, etc.

01:31:45 So it's gratifying to see that that had an industry impact.

01:31:50 Yeah, I'm sure that it was.

01:31:51 All right.

01:31:51 So you talked about Pixeltable sort of being inspired by both database work and talking to all these vision folks.

01:31:58 What exactly is it?

01:31:59 It's like a database for images?

01:32:01 Well, it's not.

01:32:02 The name is a little misleading.

01:32:04 It is, at its core, it's an OLTP database system.

01:32:08 So think Postgres, basically something that you can use to power a web app.

01:32:12 You can do single row inserts and so forth.

01:32:15 It is multi-user.

01:32:16 It is transactional.

01:32:17 But now it's specifically meant for multimodal AI or multimodal applications and AI applications in particular.

01:32:26 And so it has additional column types that represent multimodal data.

01:32:30 So you have column types, document, video, audio, image, and array.

01:32:36 Oh, that's cool.

01:32:37 Basically, you can now create a table and put videos in there.

01:32:42 And you would put the videos in there as URLs.

01:32:45 And then Pixeltable would know when you need to do something with a video to actually transparently download it and so forth.

01:32:52 So that's one thing.

01:32:54 And then another specific aspect is that you can now also have computed columns in your table.

01:33:00 And so in the AI world, when you're working with multimodal data, a lot of the work is really transformations and kind of data plumbing.

01:33:09 And so this is kind of what you now can express inside the data model.

01:33:14 And the pixel table runtime will then basically, you can basically express a complex workflow as a number of computed columns.

01:33:23 It's basically a graph, a computational graph.

01:33:25 And it could be things like, let's say, take a video.

01:33:28 You have a table with videos.

01:33:29 You want to get transcripts.

01:33:32 First, you need to extract the audio.

01:33:33 So the audio extraction would be a computed column with the output type audio.

01:33:38 And it uses a UDF called Extract Audio, which uses FFmpeg or libAV.

01:33:45 So we're using standard Python ecosystem functionality for processing this data.

01:33:51 But you can now string it together without having to think about all the data plumbing.

01:33:56 And the whole thing is still transactional.

01:33:58 So you extract the audio as a computed column.

01:34:01 And then you could invoke a transcription model to get the transcript out as an example, which would be another computed column.

01:34:07 So you can build up these very complex graphs.

01:34:10 And then you must have some kind of backend that holds the audio and holds the transcript and then links to it or kind of like you talked about in the video.

01:34:16 So what we do is all basically media data is external and file based.

01:34:22 So if you show up with, you know, let's say a petabyte of videos, we're not expecting you to upload that.

01:34:27 That would be extremely inefficient.

01:34:29 So we're simply referencing it.

01:34:31 The same is true for audio files.

01:34:33 The same is true for computed columns.

01:34:34 You can actually even put a destination attribute on a computed column and tell it to put the media data into a bucket, as an example.

01:34:43 But then we put the structured data into Postgres, and we also use pgvector, but we have our own transaction system and our own type system on top of it.

01:34:54 And so when you do an insert into a pixel table table, it figures out the complete plan, like what goes into Postgres, what needs to happen in Python directly.

01:35:04 We have a runtime system and an execution system that is now able to take this computational graph and optimize it and do parallelization and asynchronous execution and stuff like that.

01:35:16 This is incredibly interesting. The more you're telling me about it, I think this is a really clever idea. The way it kind of turns the database into this workflow engine that just hides all the messiness of, oh, I just need the audio of this thing. So we got the audio column based on, you know, like computer column. And that's, I can see that just saving so much work, so much queue, asynchronous programming and all kinds of things.

01:35:40 Yeah, exactly. And that's really the motivation and the application area, which is basically multimodal data processing, right? Including very complex systems that are fairly easy to now model as basically a sequence of tables and views. So, you know, for a podcaster like you, this might actually be very advantageous.

01:36:00 Yeah, absolutely. I do transcripts and get the audio from videos and all sorts of stuff all the time. It certainly connects with me. So two things I want to talk about on your offering here, kind of the business model and just some features before we move on to startup row. One, let's go with a smaller one first.

01:36:17 I see that you have skills for working with Pixeltable and that you can install the skills with npm or other ways.

01:36:26 You also have startup with uv, like a startup template type thing.

01:36:30 I think there's some really interesting trends for people, including skills and other AI enablers or accelerators in their documentation or in their projects.

01:36:42 That's really neat.

01:36:43 the fact that you can just get the skill so your agent can just jump right in. What's your thinking

01:36:48 there? Yeah, exactly. I mean, this is sort of the way people are writing applications today,

01:36:53 right? You kind of expect to use AI to generate most of the code. And so, you know, Pixeltable

01:36:59 comes with a skill. And we actually think that Pixeltable is very agent-friendly and very AI

01:37:05 coding-friendly because it gives you a semantic model that basically gives you type safety and

01:37:12 also type safety of your data at rest. So it gives you a lot of guardrails and it allows you to

01:37:19 express these complex workflows with relatively little code. So there is less room for the AI

01:37:26 to drive the thing off a cliff, right? You're not going to, even with a very complex pipeline or a

01:37:30 very complex workflow, it's typically no more than a few hundred lines of code because all of the

01:37:36 data plumbers abstracted through the data model.

01:37:39 And so you don't end up, you end up with something that is actually maintainable

01:37:44 and where the AI has a far higher success rate of putting something together that actually works.

01:37:50 Then if you have to wire it all up from a bunch of components and you end up with like 10,000 lines of code

01:37:56 that if you go back to it a month later and try to change something, it's gonna quickly get out of hand, et cetera.

01:38:02 So this is really, we really see this as enabling technology for creating these applications

01:38:09 quickly and efficiently and also safely with AI.

01:38:13 I think it's a great idea.

01:38:15 And so many people talk about AI hallucinations and AI just creating a bunch of slop and junk.

01:38:21 And I feel a lot of that is because it's not guided and constrained.

01:38:25 You talked about the guardrails.

01:38:26 And instead of just saying, well, it's like Postgres, just so make me a database.

01:38:30 You know what I mean?

01:38:30 And it doesn't know about all these really cool features.

01:38:32 the skill comes on, it's like, oh, yeah, you need transcripts, you need audio.

01:38:36 It really can get a much better outcome.

01:38:39 Okay, so the other business thing before we jump into Startup Row is I see two things in your nav.

01:38:45 I see open source and I see pricing.

01:38:48 What's the business model and what's open source and what's pricing?

01:38:50 Yeah, yeah.

01:38:51 How does this work?

01:38:51 So Pixeltable is available as a locally pip installable package, and that is fully open source.

01:38:56 You can just, like I said, pip install it, run it locally.

01:38:59 And we are working on a cloud service where-

01:39:04 so like I said, Pixeltable is a database at its core.

01:39:07 When you install it locally, all the tables are local.

01:39:10 There's also a piece of serving infrastructure.

01:39:14 So you can very quickly, from a table definition, actually come up, bring up a REST endpoint,

01:39:20 and basically serve the logic and create basically CRUD applications with it.

01:39:25 But that's all on your local machine.

01:39:27 Or if you want to run it in the cloud, You have to host it yourself.

01:39:30 And we're soon going to be able to offer a basically cloud hosted tables, basically a cloud service that allow you to do all of that in the cloud, kind of like you think of an RDS or, you know, a snowflake or something like that.

01:39:42 But now for multimodal AI applications.

01:39:46 So that's amazing.

01:39:47 Yeah, exactly.

01:39:48 I think that's a really good solid model.

01:39:51 You know, like a lot of people use these tools, but then they're just not DevOps folks or they don't want to run servers.

01:39:58 You know, they just like, you guys just handle that for us, right?

01:40:00 And that's a perfect synergy without misaligned incentives and stuff.

01:40:05 Let's talk Startup Row.

01:40:06 Why do you guys, why do you apply to it?

01:40:08 Why do you go to this?

01:40:10 Yeah, obviously, you know, PyCon, a very large conference, draws a, you know, a technical and very interested audience.

01:40:19 And so, and we're obviously part of the Python ecosystem, right?

01:40:22 It was sort of Pixeltable is fully in Python.

01:40:26 And so this was a great opportunity for us to basically, you know, meet a bunch of folks.

01:40:33 And we had a very large number of interesting conversations where people basically came by and said, hey, what is Pixeltable?

01:40:39 And then very often the response was, oh, this is really interesting.

01:40:44 I know either I am working on something related or I know a colleague of mine who was just trying to solve a similar problem, etc.

01:40:50 So there were a lot of really interesting conversations that started that way.

01:40:54 Yeah, that's great.

01:40:56 I, you always hear that people say, oh, you should talk to customers as you're getting

01:40:59 your company up and running and people who are using your product and so on.

01:41:02 And it's one thing to put up a website.

01:41:04 It's another to get people engaged enough to spend some time talking with you.

01:41:08 And especially having that first experience, you know, people who look at your site and

01:41:11 don't make sense of it, just leave.

01:41:12 They don't take the time to talk to you.

01:41:14 But at the booth, people come by and they, they maybe have no idea and you can get these

01:41:19 first impressions.

01:41:19 And so did you learn some interesting stuff talking to people at the show?

01:41:24 Yeah, I mean, there's actually a large number of people who are building applications like this and sort of struggling to put it together.

01:41:32 So there's a fair amount of interest in multimodal AI.

01:41:37 And then also, as I mentioned, everybody's using AI for coding today.

01:41:43 So being AI-friendly, coding agent-friendly is also very important.

01:41:50 Yeah, of course.

01:41:51 So how'd StartupRow go?

01:41:52 You happy you went there?

01:41:54 it's worth your time and yeah yeah absolutely absolutely we had a good uh two days i was there

01:41:59 for two days and so yeah we had a lot of uh good conversations and not all of them i've had the

01:42:05 chance to follow up on yet but uh we will we will do that as well so yeah we're we're hoping we're

01:42:11 hoping it'll um it'll it'll help our business yeah i have a bunch of partnership follow-ups and other

01:42:18 things i want to reach out to people i met at pycon a couple weeks ago and i'm still just digging

01:42:23 backlog of email and work that I got from being away for a week. So it takes some time.

01:42:29 It does.

01:42:29 It does. So tell us about the tech behind this. You mentioned Postgres already. So I guess it's

01:42:36 built kind of on top of Postgres, but tell us about some of the tech that makes Pixeltable go.

01:42:41 Yeah, exactly. So Postgres is a component of the underlying stack. Like I said, we use Postgres

01:42:47 for structured storage and for transactional updates to structured storage or structured data.

01:42:54 And then obviously, because we're dealing with a lot of media data, we also need to maintain files.

01:43:00 There is an execution engine, which basically given a query or an update, like an insert, insert, update, delete, creates a plan, an execution plan.

01:43:09 So it looks very much like a database system on the inside.

01:43:12 So there is a catalog that records the metadata persistently.

01:43:16 We also put that into Postgres and then an execution system that creates a plan for any, like I said, query update statement and then runs through the plan.

01:43:28 And part of it is, like I said, our own asynchronous execution engine.

01:43:34 And then we have a whole bunch of integrations with external API providers like Anthropic, OpenAI, and then also integrations with things like the PIL package.

01:43:44 So, like I said, image is a column type and you can now run all the PIL image transformations on your image data simply via basically putting that into a query or a computed column.

01:43:58 Right. So you don't have to. And again, the output then is stored back in the right place.

01:44:02 So you don't have to think about intermediate data and where it all lands or how you find it again and so forth.

01:44:08 So we're really tying together an execution system with the storage system and doing it all transactionally.

01:44:15 So it's a full multi-user database and with transactional, you know, isolation, atomicity, semantics and so forth.

01:44:22 Sounds awesome. So where's things going? How long has this been around to start, I guess?

01:44:27 When did you make it public?

01:44:29 We incorporated in April of 24.

01:44:32 Yeah.

01:44:32 So a little over two years.

01:44:34 And we're about to launch the, like I said, service for these cloud hosted tables.

01:44:41 So basically taking a local pixel table.

01:44:44 And then we're also right from the very beginning, it's been very important for us to give you

01:44:48 basically a hybrid experience.

01:44:50 And so it was always the plan that the local SDK would be there.

01:44:54 and there would be a cloud service to complement it.

01:44:57 It was never the idea that there would only be a cloud service, right?

01:45:01 So a lot of, especially data scientists like to work locally, but we also see that as an advantage now with agentic coding

01:45:09 that agents AI likes to iterate quickly and doing that locally is quick by definition, right?

01:45:14 You avoid server round trips and all that stuff.

01:45:16 So we're really happy that that's how the setup works.

01:45:22 So, but yeah, so the next step, like I said, in the next few months, we'll have the cloud service up and running.

01:45:31 Oh, excellent.

01:45:32 If somebody is using Postgres today and they think, oh, some of these extra features, like a document column type or image column type or whatever, sounds really great.

01:45:41 What's the process from going from like a regular Postgres setup to yours?

01:45:47 Like, could we use SQLAlchemy to talk to Pixeltable, or do we need a special driver to speak to it?

01:45:53 Yeah, so Pixeltable does not speak SQL.

01:45:56 So, and Pixeltable is also not a, it's a separate database system.

01:46:00 Really, we use Postgres under the covers, but it's not a Postgres layer or anything like that or a plugin.

01:46:06 So you would need to ingest your data from Postgres into Pixeltable.

01:46:09 Right. Kind of a whole two data models in memory at once for a moment and do a migration and then

01:46:16 then just swap or something like that. Yeah, exactly. And then, like I said,

01:46:20 it's not a, we don't have SQL support. It's not meant for analytic applications really. And so

01:46:27 you would be using the pixel table SDK for, you know, to both express the table structure and then

01:46:33 also create your, like I said, there is a very simple way of creating services given table

01:46:39 definitions. Yeah, excellent. Okay, so let's close out this segment by having you just speak to people

01:46:44 who this sounds interesting to. What should they do to get started? Yeah, I mean, my advice is

01:46:51 look in the docs, go to the GitHub page, look through the readme, have install it. We have a,

01:46:56 you know, a quick start guide, and then there's a whole bunch of tutorials. And one of them is

01:47:03 probably going to reflect something that you're interested in, such as, let's say, creating an

01:47:09 index of audio transcripts of your videos, as an example, right? That's something you can actually

01:47:14 do in probably half an hour. And then play around with it. Okay, excellent. Marcel, thank you for

01:47:20 being on the show. And Pixeltable looks like a really cool product. Congratulations. Michael,

01:47:25 thank you for having me again. You bet. Bye-bye. This has been another episode of Talk Python to

01:47:30 Thank you to our sponsors.

01:47:31 Be sure to check out what they're offering.

01:47:33 It really helps support the show.

01:47:35 What if your AI agents worked like FastAPI microservices, typed, autonomous, and discovering each other at runtime?

01:47:41 That's the world AgentField is building.

01:47:44 Join them at talkpython.fm/agentfield.

01:47:47 If you or your team needs to learn Python, we have over 270 hours of beginner and advanced courses

01:47:53 on topics ranging from complete beginners to async code, Flask, Django, HTMX, and even LLMs.

01:47:59 Best of all, there's no subscription in sight.

01:48:02 Browse the catalog at talkpython.fm.

01:48:05 And if you're not already subscribed to the show on your favorite podcast player,

01:48:09 what are you waiting for?

01:48:10 Just search for Python in your podcast player.

01:48:12 We should be right at the top.

01:48:13 If you enjoyed that geeky rap song, you can download the full track.

01:48:16 The link is actually in your podcast blur show notes.

01:48:19 This is your host, Michael Kennedy.

01:48:21 Thank you so much for listening.

01:48:22 I really appreciate it.

01:48:23 I'll see you next time.

01:48:36 I'm out.