R: Remove duplicate rows in a data frame – Snippet #2

Discover how to remove duplicate rows in a data frame with R

15 Apr 2022 — 1 min read

Packages

This snippet requires dplyr.

With the Tidyverse:

library(tidyverse)

Without the Tidyverse:

library(dplyr)

To remove duplicate rows in a data frame, use distinct(). Duplicate rows are rows that are perfectly identical.

With the pipe operator:

new_df <- df %>%
  distinct()

Without the pipe operator:

new_df <- distinct(df)

The code above removes all perfectly identical rows in df.

Bluesky is becoming an increasingly appealing social media platform for a wide variety of economists

I am going back to academia

An illustration of the limits of aggregating data

Discover the features of version 1.86, a major update of the platform