社区教程 Wiki

注册登录

创作新主题

社区所有版块导航

Python

python开源 Django Python DjangoApp pycharm

DATA

docker Elasticsearch

问与答闲聊招聘翻译创业分享发现分享创造求职区块链支付之战

aigc

aigc chatgpt

WEB开发

linux MongoDB Redis DATABASE NGINX 其他Web框架 web工具 zookeeper tornado NoSql Bootstrap js peewee Git bottle IE MQ Jquery

机器学习

机器学习算法

Python88.com

反馈公告社区推广

产品

短视频

印度

一周十大热门主题

【报告】AIGC专题三：2025字节跳动：深度布局AIGC，竞逐新一轮技术浪潮（附PDF下载）

(from 饼干哥哥) GitHub 上 17 个优秀的 Cla-20250826081440

【精选报告】AIGC专题一：量子位智库：2025中国AIGC应用全景图谱（附PDF下载）

深度学习入门与进阶的核心阅读清单，Ilya Sutskever -20250826065855

ChatGPT 4.5 国内直接用！

#美国16岁男孩自杀父母起诉Chatgpt#美国一名16岁男孩自-20250827170228

2025 最新版：用Python快速上手人工智能与机器学习

2025 年 7 月 GitHub 十大热门项目排行榜！

Beam：Python生态下安全高效的无服务器AI基础设施，专为-20250827214011

均匀颜色调色板；Ghrc.io是恶意网站，窃取GitHub凭证；美国政府持有英特尔股份可能影响行业竞...

关注

Py学习 » Python

Python使用索引列计算精确匹配的列

SG Kwon • 3 年前 • 1540 次点击

我有一个0和1的数据帧

a   1 1 1 1 0 0 0 1 0 0 0 0 0
b   1 1 1 1 0 0 0 1 1 0 0 0 0
c   1 1 1 1 0 0 0 1 1 1 1 0 0
d   1 1 1 1 0 0 0 1 1 1 1 0 0
e   1 1 1 1 0 0 0 0 0 0 0 1 1
f   1 1 1 1 1 1 1 0 0 0 0 0 0

(无标题)

我想做一个函数,如果一个给定字符串的列表(行名),

输出将是与字符串完全匹配的列数

例如

def exact_match(ls1):
  ~~~~~
  return col_num

print(exact_match(['c', 'd']))
>>> 2

输出为2,因为

精确匹配的列只有两个。

Python社区是高质量的Python/Django开发社区
本文地址：http://www.python88.com/topic/133033

1540 次点击

文章 [ 2 ] | 最新文章 3 年前

• 1 楼

MoRe 3 年前

如果我理解你的意思,对吗

你的数据框是这样的:

df = pd.DataFrame(data = [
    ["a", 1, 1, 1, 1, 0, 0, 0, 1, 0, 0, 0, 0, 0],
    ["b", 1, 1, 1, 1, 0, 0, 0, 1, 1, 0, 0, 0, 0],
    ["c", 1, 1, 1, 1, 0, 0, 0, 1, 1, 1, 1, 0, 0],
    ["d", 1, 1, 1, 1, 0, 0, 0, 1, 1, 1, 1, 0, 0],
    ["e", 1, 1, 1, 1, 0, 0, 0, 0, 0, 0, 0, 1, 1],
    ["f", 1, 1, 1, 1, 1, 1, 1, 0, 0, 0, 0, 0, 0],
])
df = df.rename(columns = {0:"name"}).set_index("name")

然后:

def exact_match(lst):
    s = df[df.columns[df.loc[lst].sum(axis = 0) == len(lst)]].sum(axis = 0) == len(lst)
    return len(s[s])
exact_match(["c","d"]) # output: 2

• 2 楼

mozway 3 年前

问题尚不清楚,但如果您想获得在提供的索引中只有1而在其他行中没有的列,可以使用:

def exact_match(ls1):
    # 1s on the provided indices
    m1 = df.loc[ls1].eq(1).all()
    # no 1s in the other rows
    m2 = df.drop(ls1).ne(1).all()
    # slice and get shape
    return df.loc[:, m1&m2].shape[1]
    # or
    # return (m1&m2).sum()

print(exact_match(['c', 'd']))
# 2

登录后回复

关于移动版

Py学习 - 专注于Python技术发展的社区(原Django社区)