pandas.DataFrame.duplicated¶
Return boolean Series denoting duplicate rows, optionally only considering certain columns.
- param subset
- column label or sequence of labels, optional
Only consider certain columns for identifying duplicates, by default use all of the columns
- param keep
- {‘first’, ‘last’, False}, default ‘first’
first
Mark duplicates asTrue
except for thefirst occurrence.
last
Mark duplicates asTrue
except for thelast occurrence.
False : Mark all duplicates as
True
.
- return
Series
Warning
This feature is currently unsupported by Intel Scalable Dataframe Compiler