熊猫 concat: ValueError: 传递值的形状是胡说八道，索引意味着 blah2

小开

Aus_lacy's post gave me the idea of trying related methods, of which 加入 does work:

In [196]:


hl.name = 'hl'
Out[196]:
'hl'
In [199]:


df.join(hl).head(4)
Out[199]:
high    low loc_h   loc_l   hl
2014-01-01 17:00:00 1.376235    1.375945    1.376235    1.375945    1.376090
2014-01-01 17:01:00 1.376005    1.375775    NaN NaN NaN
2014-01-01 17:02:00 1.375795    1.375445    NaN 1.375445    1.375445
2014-01-01 17:03:00 1.375625    1.375515    NaN NaN NaN

不过，如果能深入了解为什么 concat 可以在示例上工作，而这些数据却不行，那就再好不过了！

小开

I had a similar problem (join worked, but concat failed).

检查 df1和 s1中的重复索引值(例如 df1.index.is_unique)

删除重复的索引值(例如，df.drop_duplicates(inplace=True))或这里的 https://stackoverflow.com/a/34297689/7163376方法之一应该可以解决这个问题。

小开

我的问题是不同的索引，下面的代码解决了我的问题。

df1.reset_index(drop=True, inplace=True)
df2.reset_index(drop=True, inplace=True)
df = pd.concat([df1, df2], axis=1)

小开

您的索引可能包含重复的值。

import pandas as pd


T1_INDEX = [
0,
1,  # <= !!! if I write e.g.: "0" here then it fails
0.2,
]
T1_COLUMNS = [
'A', 'B', 'C', 'D'
]
T1 = [
[1.0, 1.1, 1.2, 1.3],
[2.0, 2.1, 2.2, 2.3],
[3.0, 3.1, 3.2, 3.3],
]


T2_INDEX = [
1.2,
2.11,
]


T2_COLUMNS = [
'D', 'E', 'F',
]
T2 = [
[54.0, 5324.1, 3234.2],
[55.0, 14.5324, 2324.2],
# [3.0, 3.1, 3.2],
]
df1 = pd.DataFrame(T1, columns=T1_COLUMNS, index=T1_INDEX)
df2 = pd.DataFrame(T2, columns=T2_COLUMNS, index=T2_INDEX)




print(pd.concat([pd.DataFrame({})] + [df2, df1], axis=1))

小开

在连接索引后尝试对它们进行排序

result=pd.concat([df1,df2]).sort_index()

小开

要删除重复的索引，请在15:25使用 df = df.loc[df.index.drop_duplicates()].C.F. Pandas.pydata.org/pandas-docs/stable/generated/.-BallpointBen Apr 18

这是错误的，但我不能直接回应 BallpointBen 的评论，因为声誉低下。其错误之处在于，df.index.drop_duplicates()返回一个唯一索引列表，但是当您使用这些唯一索引重新索引到数据框架时，它仍然返回所有记录。我认为这可能是因为使用重复索引之一的索引将返回索引的所有实例。

相反，使用 df.index.duplicated()，它返回一个布尔列表(添加 ~以获得不重复的记录) :

df = df.loc[~df.index.duplicated()]

小开

也许很简单，试试这个如果您有一个数据框架。然后确保您试图组合的矩阵或向量具有相同的 rows _ name/index

我遇到了同样的问题，我修改了行的名称索引，使它们彼此匹配这里有一个矩阵(主成分)和向量(目标)具有相同行索引的例子(我在图片左侧用蓝色圈出了它们)

Before, "when it was not working", I had the matrix with normal row indicies (0,1,2,3) while I had the vector with row indices (ID0, ID1, ID2, ID3) 然后我将向量的行索引改为(0,1,2,3) ，这对我很有用。

在此输入图像描述