A Statistician’s R Notebook
  • About
  • BlogRoll
    • R-bloggers
    • R-Studio Community
    • R weekly
    • Allison Horst
    • Github R Topics
Categories
All (22)
adjusted-r-squared (1)
API (2)
apply (1)
categorical data (1)
Causal Inference (1)
Causation (1)
CBRT (1)
clean_names (1)
Correlation (1)
CSV (1)
Data Analysis (4)
Data Cleaning (2)
Data Export (1)
Data Import (1)
Data Preprocessing (2)
Data Science (3)
data transformation (1)
data types (6)
dataframe (1)
date handling (1)
do.call (1)
Econometrics (1)
EVDS (1)
Excel (2)
factor (1)
Functions (1)
grep (1)
grepl (1)
gsub (1)
Import (2)
Imputation (1)
Inter Quartile Range (1)
lapply (1)
lists (1)
lubridate (1)
Machine Learning (1)
map (1)
matrices (1)
Missing Data (1)
model-evaluation (1)
OECD (1)
openxlsx (1)
Outliers (1)
pivot_longer (1)
pivot_wider (1)
predictive-modeling (1)
Public Health (1)
R (4)
R Programming (16)
R Studio (1)
r-squared (1)
reduce (1)
regex (1)
Report Automation (1)
rsdmx (1)
sapply (1)
Standard Deviation (1)
Standard Error (1)
Statistics (5)
sub (1)
text analysis (1)
tibble (1)
tidymodels (1)
tidyr (1)
time manipulation (1)
time series (1)
vapply (1)
vectors (1)
Z-Score (1)

Understanding Data Import and Export in R: Working with CSV and Excel Files
R Programming
Data Analysis
Data Science
CSV
Excel
Data Import
Data Export
When learning R, most people focus on functions, models, and visualizations. However, many real-world problems start much earlier — at the data import stage — and end much…
M. Fatih Tüzen
Dec 26, 2025

Outliers in Data Analysis: Detecting Extreme Values Before Modeling in R with İstanbul Airbnb Data
R
Statistics
Data Analysis
Data Science
Data Preprocessing
Outliers
Inter Quartile Range
Z-Score
Data Cleaning
Data preprocessing is often presented as a sequence of technical steps. However, each preprocessing decision implicitly embeds a statistical assumption.
M. Fatih Tüzen
Dec 19, 2025

Handling Missing Data in R: A Comprehensive Guide
R
Statistics
Data Analysis
Data Science
Data Preprocessing
Missing Data
Data Cleaning
Imputation
Data preprocessing is a cornerstone of any data analysis or machine learning pipeline. Raw data rarely comes in a form ready for direct analysis — it often requires…
M. Fatih Tüzen
Aug 18, 2025

Standard Deviation vs. Standard Error: Meaning, Misuse, and the Math Behind the Confusion
R
Statistics
Data Analysis
Standard Deviation
Standard Error
In the world of data analysis and statistics, standard deviation (SD) and standard error (SE) are two concepts that are often misunderstood or—worse—used interchangeably.…
M. Fatih Tüzen
Jul 11, 2025

Correlation vs Causation: Understanding the Difference
Statistics
Correlation
Causation
Econometrics
Causal Inference
Public Health
“Correlation is not causation” – it’s a refrain we hear often, yet the distinction between these concepts is deceptively easy to overlook. Correlation refers to a…
M. Fatih Tüzen
Jun 4, 2025

Explained vs. Predictive Power: R², Adjusted R², and Beyond
R
Statistics
Machine Learning
r-squared
adjusted-r-squared
predictive-modeling
tidymodels
model-evaluation
You trust R². Should you?
You proudly present a model with R² = 0.95. Everyone applauds.
But what if your model fails miserably on the next new data?
M. Fatih Tüzen
Apr 30, 2025

Underrated Gems in R: Must-Know Functions You’re Probably Missing Out On
reduce
vapply
do.call
clean_names
R is packed with powerhouse tools—think dplyr for data wrangling, ggplot2 for stunning visuals, or tidyr for tidying up messes. But beyond the headliners, there’s a lineup…
M. Fatih Tüzen
Mar 11, 2025

Unlocking CBRT Data in R: A Guide to the CBRT R Package
R Programming
CBRT
EVDS
Import
API
The Central Bank of the Republic of Turkey (CBRT) provides a wealth of economic data crucial for researchers, analysts, and policymakers. Through the Electronic Data…
M. Fatih Tüzen
Dec 31, 2024

Extracting Data from OECD Databases in R: Using the oecd and rsdmx Packages
R Programming
OECD
rsdmx
Import
API
The OECD (Organisation for Economic Co-operation and Development) provides extensive databases for economic, social, and environmental indicators. Accessing these…
M. Fatih Tüzen
Dec 16, 2024

Creating Professional Excel Reports with R: A Comprehensive Guide to openxlsx Package
R Programming
Report Automation
openxlsx
Excel
The ability to generate professional Excel reports programmatically is a crucial skill in data analysis and business reporting. In this comprehensive guide, we’ll explore…
M. Fatih Tüzen
Nov 4, 2024

Artwork by: Allison Horst

Mastering Date and Time Data in R with lubridate
R Programming
lubridate
time series
time manipulation
date handling
lubridate is a powerful and widely-used package in the tidyverse ecosystem, specifically designed for making date-time manipulation in R both easier and more intuitive. It…
M. Fatih Tüzen
Sep 30, 2024

Artwork by: Shannon Pileggi and Allison Horst

Mastering Data Transformation in R with pivot_longer and pivot_wider
R Programming
tidyr
pivot_wider
pivot_longer
data transformation
Data analysis requires a deep understanding of how to structure data effectively. Often, datasets are not in the format most suitable for analysis or visualization. That’s…
M. Fatih Tüzen
Sep 19, 2024

Text Data Analysis in R: Understanding grep, grepl, sub and gsub
R Programming
grep
grepl
sub
gsub
regex
text analysis
In text data analysis, being able to search for patterns, validate their existence, and perform substitutions is crucial. R provides powerful base functions like grep, grepl, …
M. Fatih Tüzen
Jul 9, 2024

Exploring apply, sapply, lapply, and map Functions in R
R Programming
apply
sapply
lapply
map
In R programming, Apply functions (apply(), sapply(), lapply()) and the map() function from the purrr package are powerful tools for data manipulation and analysis. In this…
M. Fatih Tüzen
Apr 15, 2024

R Function Writing 101:A Journey Through Syntax, Best Practices, and More
R Programming
Functions
R is a powerful and versatile programming language widely used in data analysis, statistics, and visualization. One of the key features that make R so flexible is its…
M. Fatih Tüzen
Jan 23, 2024

https://allisonhorst.com/everything-else

Cracking the Code of Categorical Data: A Guide to Factors in R
R Programming
data types
factor
categorical data
R programming is a versatile language known for its powerful statistical and data manipulation capabilities. One often-overlooked feature that plays a crucial role in…
M. Fatih Tüzen
Jan 11, 2024

https://openscapes.org/blog/2020-10-12-tidy-data/

Unraveling DataFrames in R: A Comprehensive Guide
R Programming
data types
dataframe
tibble
In R, a data frame is a fundamental data structure used for storing data in a tabular format, similar to a spreadsheet or a database table. It’s a collection of vectors of…
M. Fatih Tüzen
Dec 29, 2023

Understanding Lists in R Programming
R Programming
data types
lists
R, a powerful statistical programming language, offers various data structures, and among them, lists stand out for their versatility and flexibility. Lists are collections…
M. Fatih Tüzen
Dec 19, 2023
No matching items
  • 1
  • 2

© 2023, M. Fatih Tüzen

 

This page is built with Quarto.