NumPy documented in Nature

In September 2020, the journal Nature published “Array Programming with NumPy” (Charles R. Harris, Stefan van der Walt, and 24 co-authors), the first formal scientific description of the NumPy library written by its core developers. NumPy itself dates back roughly two decades earlier, but this paper marked the open-source project’s documentation in a top peer-reviewed venue.

NumPy provides the N-dimensional array - a single, fast, uniform way to represent and operate on numerical data in Python. The paper describes this as “a simple and powerful programming paradigm for organizing, exploring, and analyzing scientific data” and calls the library “the foundation upon which the entire scientific Python universe is constructed.” It also notes NumPy’s role in landmark results such as the gravitational-wave detection and the first black-hole image.

This matters to the AI story because NumPy is the bedrock layer of the deep learning stack. The frameworks above it - TensorFlow, PyTorch, and the wider scientific Python ecosystem - inherit NumPy’s array model and its conventions. When practitioners say AI runs on Python, the array abstraction they are leaning on is, in large part, the one NumPy standardized.

Sources

Last verified June 6, 2026