代码之家 › 专栏 › 技术社区 › dumbledad

大内存机器内存错误,小内存机器内存错误:相同代码,相同数据

pandas

dumbledad · 技术社区 · 6 年前

我在我的两台机器上运行以下操作:

import os, sqlite3
import pandas as pd
from feat_transform import filter_anevexp
db_path = r'C:\Users\timregan\Desktop\anondb_280718.sqlite3'
db = sqlite3.connect(db_path)
anevexp_df = filter_anevexp(db, 0)

在我的笔记本电脑(8GB内存)上,这个运行没有问题(尽管 filter_anevexp 需要几分钟)。在我的桌面(128GB的RAM)上,它以内存错误的方式失败:

Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
  File "C:\Users\timregan\source\MentalHealth\code\preprocessing\feat_transform.py", line 171, in filter_anevexp
    anevexp_df = anevexp_df[anevexp_df["user_id"].isin(df)].copy()
  File "C:\Users\timregan\AppData\Local\Programs\Python\Python37-32\lib\site-packages\pandas\core\frame.py", line 2682, in __getitem__
    return self._getitem_array(key)
  File "C:\Users\timregan\AppData\Local\Programs\Python\Python37-32\lib\site-packages\pandas\core\frame.py", line 2724, in _getitem_array
    return self._take(indexer, axis=0)
  File "C:\Users\timregan\AppData\Local\Programs\Python\Python37-32\lib\site-packages\pandas\core\generic.py", line 2789, in _take
    verify=True)
  File "C:\Users\timregan\AppData\Local\Programs\Python\Python37-32\lib\site-packages\pandas\core\internals.py", line 4539, in take
    axis=axis, allow_dups=True)
  File "C:\Users\timregan\AppData\Local\Programs\Python\Python37-32\lib\site-packages\pandas\core\internals.py", line 4425, in reindex_indexer
    for blk in self.blocks]
  File "C:\Users\timregan\AppData\Local\Programs\Python\Python37-32\lib\site-packages\pandas\core\internals.py", line 4425, in <listcomp>
    for blk in self.blocks]
  File "C:\Users\timregan\AppData\Local\Programs\Python\Python37-32\lib\site-packages\pandas\core\internals.py", line 1258, in take_nd
    allow_fill=True, fill_value=fill_value)
  File "C:\Users\timregan\AppData\Local\Programs\Python\Python37-32\lib\site-packages\pandas\core\algorithms.py", line 1655, in take_nd
    out = np.empty(out_shape, dtype=dtype)
MemoryError

在内存大的机器上,我需要做什么特别的事情来防止错误(例如寻址错误)?

注意:我没有把代码包括在 过滤器\u anevexp

1 回复 | 直到 6 年前

eljiwo 6 年前

您在家用pc中使用的是32位版本,这意味着您的python可执行文件只能访问4gb的ram。尝试用64位而不是当前使用的32位重新安装python37。

推荐文章

user29747013 · 如何创建一个新的数据框架,其中包含原始数据框架中列的聚合列?

6 月前

Cam · Pandas列表日期到日期时间

6 月前

jjkennedy · Pandas文本文件导入:当每个文件中存在多个表时,自动选择1个表

6 月前

Sun Jar · 在另一个系列中查找当前df值的索引,并将其添加到列中

7 月前

dietzi96 · Pandas DataFrame.to_sql随机和静默地失败,没有错误消息

7 月前

Bijan · Pandas批量更新帐户字符串

7 月前

Kernel · TypeError:Index.reindex()收到意外的关键字参数fill_value'

7 月前

Kernel · 进入熊猫的定义。系列super().reindex

7 月前

adventurous_chip_55 · 如何引爆柱子

7 月前

RKIDEV · Panda迭代行并将第n行值乘以下一(n+1)行值

7 月前