Douglass Read Cutting is an American software designer known for building widely used open-source search and big-data software. Before his big-data work he created Lucene, a text search library, and co-founded the Nutch web crawler project with Mike Cafarella. Both projects became part of the Apache Software Foundation.
When Cutting and Cafarella needed a way to run Nutch’s indexing across many machines, they drew on two papers Google published describing its internal infrastructure: the Google File System (GFS) and MapReduce. Reimplementing those ideas as open source produced a distributed storage and processing layer that grew into a separate project named Hadoop. Cutting later led the Hadoop project full-time at Yahoo, where it was scaled to very large clusters.
Cutting was deeply involved in the governance of the Apache Software Foundation, the home of Lucene, Nutch, and Hadoop. The Foundation’s own board meeting records from May 2010 list “Doug Cutting” among the Directors present, documenting his role as an ASF director during that period.
His career illustrates a recurring pattern in this history: ideas first described in research papers from a single company become broadly available only once someone reimplements them as open-source software that anyone can run. Hadoop did that for distributed batch processing and helped launch what became known as the big-data era.