Featuretools提供了处理分类变量的集成功能
variable_types={"product_id": ft.variable_types.Categorical} https://docs.featuretools.com/loading_data/using_entitysets.html
但是这些应该是
strings
或
pandas.Category
与Featuretools实现最佳兼容性的类型?
编辑
另外,是否需要像中那样手动指定所有列
https://github.com/Featuretools/predict-appointment-noshow/blob/master/Tutorial.ipynb
import featuretools.variable_types as vtypes
variable_types = {'gender': vtypes.Categorical,
'patient_id': vtypes.Categorical,
'age': vtypes.Ordinal,
'scholarship': vtypes.Boolean,
'hypertension': vtypes.Boolean,
'diabetes': vtypes.Boolean,
'alcoholism': vtypes.Boolean,
'handicap': vtypes.Boolean,
'no_show': vtypes.Boolean,
'sms_received': vtypes.Boolean}