A Gentle Introduction to PyPy — Faster Python With Minimal Changes

Developers
master
November 28, 2022

A Gentle Introduction to PyPy — Faster Python With Minimal Changes | by Pavel Durov

PyPy vs. Python — performance and benchmarking

In this article, I will cover my experience with the PyPy that I was only recently exposed to.

This article complements the Writing an Interpreter with PyPy tutorial from 2011 [1]. When I first tried to follow the steps of this blog post, I encountered many issues, such as out-of-date documentation, out-of-date code references, python version incompatibility, etc. I will try to cover the gotchas and my learning experience here.

Writing an Interpreter with PyPy is about creating a BF [2] interpreter and the translation process of it with the PyPy toolchain. When I followed along, the main issues for me were related to the toolchain itself. Hence I decided to centre my attention on it here. We will not cover the BF interpreter here since it’s all explained in the original blog post, but I will go through the PyPy toolchain and some basic benchmarking concepts essential for understanding the topic.

If you tried Python before, you were probably running CPython [3]. CPython is the most common implementation of Python VM (Virtual Machine). PyPy [4] is an alternative to CPython.

With PyPy, we write our programs in RPython [5], and apply the RPython translation toolchain that generates binary executable. One major advantage of PyPy over CPython is speed which will be demonstrated later in this article using the basic benchmarking tool.

Jargon summary:

CPython — The most common Python implementation
PyPy — CPython alternative
RPython — restricted version of Python
Benchmark — the act of assessing program relative performance (time in our case)

First, we need to write a simple RPython program, call it python_prime.py: