alex_bn_lee

导航

< 2025年3月 >
23 24 25 26 27 28 1
2 3 4 5 6 7 8
9 10 11 12 13 14 15
16 17 18 19 20 21 22
23 24 25 26 27 28 29
30 31 1 2 3 4 5

统计

[1068] Find records with duplicate values in the specific column

To find records with duplicate values in the column “A” of a Pandas DataFrame, you can use the duplicated() method. Here’s how you can do it:

Example

import pandas as pd
# Sample DataFrame
df = pd.DataFrame({
"A": [1, 2, 2, 3, 4, 4, 4],
"B": [5, 6, 7, 8, 9, 10, 11]
})
# Find duplicate records based on column "A"
duplicates = df[df.duplicated(subset=["A"], keep=False)]
print(duplicates)

Output

A B
1 2 6
2 2 7
4 4 9
5 4 10
6 4 11

Explanation

  • subset=["A"]: Specifies the column to check for duplicates.
  • keep=False: Ensures all duplicates are marked, not just the first occurrence.

This will give you all the rows where the values in column “A” are duplicated. If you have any specific requirements or need further assistance, feel free to ask!

 

posted on   McDelfino  阅读(11)  评论(0编辑  收藏  举报

相关博文:
阅读排行:
· DeepSeek 开源周回顾「GitHub 热点速览」
· 记一次.NET内存居高不下排查解决与启示
· 物流快递公司核心技术能力-地址解析分单基础技术分享
· .NET 10首个预览版发布:重大改进与新特性概览!
· .NET10 - 预览版1新功能体验(一)
历史上的今天:
2023-10-09 [894] Optimize arcpy scripts
2023-10-09 [893] Add comments at a batch file (CMD)
2022-10-09 【747】多分类模型metrics计算
点击右上角即可分享
微信分享提示