Skip to content

Processing Large Data with Pandas

Short Talk at 9:58AM EDT

Data sets can get large quickly. You can quickly go from looking at: a few 100 lines and a handful of columns to a million lines and with hundreds of columns.
Python Pandas (with smart use of Categories) can enable one to reduce the size of ones data in memory by up to 90%. However, careless use can increase memory use. Learn when to use categories to save memory. Slides located here

Presented by

Evelyn J. Boettcher