Introduction to Pandas¶

Pandas is a python library used for working with datasets. It has functions for analyzing, cleaning, exploring and manipulating data. Pandas is fast and it has high performance & productivity for users.

Why use Pandas?¶

  1. Pandas allows us to analyze big data and make conclusions based on satistical theories.
  2. Pandas can clean messy datasets and make them readable.
  3. Easy handling of missing data (represented as NaN) in floating point as well as non-floating point data.

Pandas deals with the following twoe data structures −¶

  1. Series
  2. DataFrame

Pandas Installation¶

In [ ]:
!pip install pandas

Import Pandas¶

In [1]:
import pandas

Pandas is usually imported under the pd alias¶

In [2]:
import pandas as pd

Check Python and Pandas version¶

In [3]:
import platform 
import pandas as pd
print('Python version: ' + platform.python_version())
print('Pandas version: ' + pd.__version__)
Python version: 3.10.4
Pandas version: 1.4.2