代码之家 › 专栏 › 技术社区 › Adriana

尝试创建文件列表时出错

data-cleaning typeerror operating-system python

Adriana · 技术社区 · 1 年前

我有一个包含20个csv文件的文件夹。每个文件大约有10列和数千行。csv文件如下所示:

基因	p值	xyz
棘皮	0.05	123
基质金属蛋白酶2	0.02	456
mmp9	0.07	789
nnos	0.09	123
gfap	0.01	456

我编写了以下脚本,目的是浏览每个文件,并仅根据我指示的感兴趣的基因(在本例中为mmp2和mmp9)过滤行。

# the goal is to edit and save the csv files so they only contain the genes of interest

path = '/Users/adriana/Library/Documents/raw_data',
all_files = glob.glob(os.path.join(path, "*.csv")) #make list of file paths 
genes = ["mmp2", "mmp9"]
for file in all_files:
    path = '/Users/adriana/Library/Documents/raw_data'
    df = pd.read_csv(file,delimiter ='\t')
    cleaned = df[df['gene'].isin(genes)]
    cleaned.to_csv(file)

但是,我收到以下与创建对象“all_files”有关的错误:

TypeError:应为str、字节或os。PathLike对象,而不是元组

我以前无缝地使用过这个脚本,所以我不确定发生了什么。

1 回复 | 直到 1 年前

Aymen Azoui 1 年前

试试这个:

import os
import glob
import pandas as pd



path = '/Users/adriana/Library/Documents/raw_data'  # Removed comma here
all_files = glob.glob(os.path.join(path, "*.csv"))  # make list of file paths 
genes = ["mmp2", "mmp9"]
for file in all_files:
    df = pd.read_csv(file, delimiter='\t')
    cleaned = df[df['gene'].isin(genes)]  
    cleaned.to_csv(file, index=False)

推荐文章

Denis · 在C、linux中同步进程

1 年前

Depleted Money · 源代码显示的不同输出(机器学习)(Python)

1 年前

ridhomblr · 如果DI>32767,VGA输出不显示

1 年前

SoulSystem · (C#)如何编写适用于不同操作系统的不同代码?

1 年前

Alex S. · 如何在python中的PIL保存函数中指定要保存到哪个文件?

1 年前

Eric B · 当CPU/内核被硬件中断淹没时,它是如何允许用户空间代码运行的?

1 年前

dmgzh · 如何根据所使用的系统更改变量值?(Python)

1 年前

gitm_248 · Ubuntu安装和关闭的问题:寻求解决问题的指导

1 年前

Justin Villerot SleepyEyes · Debian上的Startx黑屏

1 年前

Adriana · 尝试创建文件列表时出错

1 年前