A Series of Short Articles on Data Processing

Ofili Lewis
2 min readJan 26, 2023
Photo by Campaign Creators on Unsplash

Data processing is a critical aspect of modern data management, and is essential for organizations of all sizes and industries. The series of short articles on data processing aims to provide an in-depth understanding of the different techniques and tools used for data processing, such as data cleaning, preprocessing, visualization, normalization, transformation, reduction, integration, validation, and quality control.

The series will start with an overview of data processing and its importance, and then delve into each individual technique in detail. We will cover various data processing techniques such as data cleaning, which is used to remove missing, duplicate, or irrelevant data, and data preprocessing, which is used to prepare data for analysis and modeling. We will also cover data visualization, which is used to understand and communicate data insights through graphical representations, data normalization, which is used to ensure that data is in a consistent format, and data transformation, which is used to convert data from one format or structure to another.

We will also discuss data reduction, which is used to reduce the size of data while maintaining its integrity, data integration, which is used to combine data from multiple sources into a single, unified dataset, and data validation, which is used to ensure that data is accurate, consistent, and complete. Finally, we will cover data quality control, which is used to ensure that data is of high quality, and can be used to make more informed decisions.

Throughout the series, we will provide examples in Python and SQL where possible, so that readers can apply the concepts and techniques to their own data processing projects. Additionally, we will cover various best practices and common pitfalls to avoid during data processing.

This series is intended for data scientists, analysts, and anyone interested in understanding the data processing process and how to use it to improve their data analysis and decision-making capabilities.

See the articles below.

--

--

Ofili Lewis

Transforming and making data more accessible so that organizations can use it to evaluate and optimize performance.