Glossary
Programming Fundamentals
Variable โ A named location used to store data in memory.
Function โ A reusable block of code designed to perform a specific task.
Loop โ A control flow structure that repeats a block of code (for, while).
Conditional Statement โ A logical structure used for decision-making in code (if, else).
Boolean โ A data type with two values: True or False.
Data Structures
List โ A mutable, ordered collection of items in Python.
Tuple โ An immutable, ordered collection of items.
Dictionary โ An unordered, mutable collection of key-value pairs.
Set โ An unordered collection of unique items.
Array โ A sequence of elements stored in a contiguous memory location (used in NumPy).
Data Analysis
DataFrame โ A two-dimensional, tabular data structure in pandas with labeled axes (rows and columns).
Series โ A one-dimensional array-like object in pandas.
Index โ Labels for accessing rows or columns in a pandas object.
Missing Data โ Data entries with null or undefined values, often represented by NaN.
File Formats
CSV (Comma-Separated Values) โ A simple text format for tabular data.
Excel (.xlsx) โ A spreadsheet file format used widely for data storage.
JSON (JavaScript Object Notation) โ A structured, human-readable format used for data interchange.
Pickle โ A Python-specific format used for serializing and saving Python objects.
Data Cleaning
Null Values โ Empty or missing data entries in a dataset.
Outliers โ Data points that differ significantly from other observations.
Data Type Conversion โ Changing a columnโs data type (e.g., string to integer).
Normalization โ Scaling numeric data to a common range.
Visualization
Matplotlib โ A low-level library for creating static plots.
Seaborn โ A high-level interface for drawing attractive statistical graphics.
Plotly โ A library for interactive visualizations.
Bar Chart โ A chart that uses bars to show frequencies or counts.
Scatter Plot โ A graph that uses dots to show values for two variables.