alex_bn_lee

导航

[1068] Find records with duplicate values in the specific column

To find records with duplicate values in the column “A” of a Pandas DataFrame, you can use the duplicated() method. Here’s how you can do it:

Example

import pandas as pd

# Sample DataFrame
df = pd.DataFrame({
    "A": [1, 2, 2, 3, 4, 4, 4],
    "B": [5, 6, 7, 8, 9, 10, 11]
})

# Find duplicate records based on column "A"
duplicates = df[df.duplicated(subset=["A"], keep=False)]
print(duplicates)

Output

   A   B
1  2   6
2  2   7
4  4   9
5  4  10
6  4  11

Explanation

  • subset=["A"]: Specifies the column to check for duplicates.
  • keep=False: Ensures all duplicates are marked, not just the first occurrence.

This will give you all the rows where the values in column “A” are duplicated. If you have any specific requirements or need further assistance, feel free to ask!

 

posted on 2024-10-09 07:58  McDelfino  阅读(10)  评论(0编辑  收藏  举报