代码之家 › 专栏 › 技术社区 › ipramusinto

什么是<U12型?

types numpy pandas arrays python

ipramusinto · 技术社区 · 6 年前

我用的是 pandas 和 numpy

array(['french', 'mexican', 'cajun_creole', ..., 'southern_us', 'italian',
       'thai'], dtype='<U12')

array(['french', 'mexican', 'cajun_creole', ..., 'jamaican', 'italian',
   'thai'], dtype=object)

我看不出有什么区别,是什么 <U12 ?

2 回复 | 直到 6 年前

Stephen Rauch Afsar Ali 6 年前

<U12 这是一个numpy类型:

<

U

12

Source )

Paul Panzer 6 年前

区别在于元素的存储方式。

<U12 将其平铺存储,每个条目的长度为12。我们可以用 tobytes 要直接访问数据缓冲区:

>>> au
array(['french', 'mexican', 'cajun_creole', 'Ellipsis', 'southern_us',
       'italian', 'thai'], dtype='<U12')
>>> 
>>> sz = au.dtype.itemsize
>>> [au.tobytes()[i:i+sz].decode('utf32') for i in range(0, au.size * sz, sz)]
['french\x00\x00\x00\x00\x00\x00', 'mexican\x00\x00\x00\x00\x00', 'cajun_creole', 'Ellipsis\x00\x00\x00\x00', 'southern_us\x00', 'italian\x00\x00\x00\x00\x00', 'thai\x00\x00\x00\x00\x00\x00\x00\x00']

object str 物体。我们可以使用以下事实来验证这一点:在当前的CPython实现中--- id 返回Python对象的内存地址:

>>> ao
array(['french', 'mexican', 'cajun_creole', Ellipsis, 'southern_us',
       'italian', 'thai'], dtype=object)
>>> 
>>> sz = ao.dtype.itemsize
>>> [int.from_bytes(ao.tobytes()[i:i+sz], 'little') for i in range(0, ao.size * sz, sz)]
[140626141129896, 140625895652128, 140625895628080, 8856512, 140625895627504, 140626141132200, 140626343518024]
>>> [id(it) for it in ao]
[140626141129896, 140625895652128, 140625895628080, 8856512, 140625895627504, 140626141132200, 140626343518024]

推荐文章

Megrez7 · C#ToArray转换合并为一行,导致数组元素更改

6 月前

bairog · 从按属性筛选的对象数组字典中创建值数组

6 月前

Anka HanÄ±m · 关于结构和动态数组地址的问题

6 月前

Swapnil Supekar · 将二维阵列转换为一维阵列,并将一维阵列粘贴到另一张图纸上

6 月前

Lorenzo Bertolaccini · 在Angular项目中通过对话框后,在控制台中显示但在HTML中不显示的数据数组

7 月前

MaSc. H. · 大小与阵列大小

7 月前

Geremia · 2D NumPy数组+1D数组?

7 月前

MARTIN · 交换第一个和最后一个单词,反转所有中间的字符

8 月前

Paul Williams · 迭代数组时输出有问题

8 月前

Oliver Morgan · PostgreSQL错误:无法累积不同维度的数组

8 月前