Tag

outliers

0 views collected around this technical thread.

Test Development Learning Exchange
Test Development Learning Exchange
Oct 28, 2024 · Big Data

Data Preprocessing with Pandas: A Comprehensive Guide

This article provides a comprehensive guide to data preprocessing using Pandas, covering essential steps like data cleaning, feature engineering, and data transformation for machine learning projects.

Categorical EncodingDataset Splittingdata cleaning
0 likes · 5 min read
Data Preprocessing with Pandas: A Comprehensive Guide
Model Perspective
Model Perspective
Sep 16, 2024 · Fundamentals

Why Identical Statistics Can Hide Very Different Data: The Lesson of Anscombe’s Quartet

Anscombe’s Quartet shows that four data sets can share identical means, variances, regression lines and correlation coefficients yet display completely different scatter‑plot shapes, highlighting why visualisation is crucial and why relying only on summary statistics can mislead analysts.

Anscombe's QuartetData Visualizationoutliers
0 likes · 6 min read
Why Identical Statistics Can Hide Very Different Data: The Lesson of Anscombe’s Quartet
Architects Research Society
Architects Research Society
Nov 21, 2016 · Artificial Intelligence

Data Science Q&A: Overfitting, Experimental Design, Tall/Wide Data, Chart Junk, Outliers, Extreme Value Theory, Recommendation Engines, and Visualization

This article presents a series of data‑science questions and expert answers covering overfitting, experimental design for user behavior, the distinction between tall and wide data, detecting chart junk, outlier detection methods, extreme‑value theory for rare events, recommendation‑engine fundamentals, and techniques for visualizing high‑dimensional data.

Experimental designRecommendation systemschart junk
0 likes · 18 min read
Data Science Q&A: Overfitting, Experimental Design, Tall/Wide Data, Chart Junk, Outliers, Extreme Value Theory, Recommendation Engines, and Visualization