我想在下面的数据框中填充一个范围内的日期,并填充所有列。当我完成此操作时,我想在“威尔斯”列中追加列表,以便它继续按日期添加项目。
按日期展开的数据框
StartDate Wells count Sum_Cumm vol
0 1967-10-01 [MUN-523, MUN-354, MUN-2660] 50 50 8503.323620
1 1968-01-01 [MUN-152] 1 51 8336.591784
2 1968-03-01 [MUN-1032] 1 52 8176.272712
3 1968-10-01 [MUN-16128] 1 53 9191.110200
我正在编写的代码
newdf = (newdf.set_index('StartDate').reindex(pd.date_range('10-01-1967', '12-31-1994', freq='MS')).rename_axis(['StartDate']).reset_index()).ffill(newdf['vol'])
我希望最终使用的数据框架
StartDate Wells count Sum_Cumm vol
0 1967-10-01 [MUN-523, MUN-354, MUN-2660] 50 50 8503.323620
1 1967-11-01 [MUN-523, MUN-354, MUN-2660] 1 51 8503.323620
2 1967-12-01 [MUN-523, MUN-354, MUN-2660] 1 51 8503.323620
3 1968-01-01 [MUN-523, MUN-354, MUN-2660,MUN-152] 1 52 8336.591784
4 1968-02-01 [MUN-523, MUN-354, MUN-2660,MUN-152] 1 53 8336.591784
5 1968-03-01 [MUN-523, MUN-354, MUN-2660,MUN-152,MUN-1032] 1 53 8176.272712
6 1968-04-01 [MUN-523, MUN-354, MUN-2660,MUN-152,MUN-1032] 1 53 8176.272712
1条答案
按热度按时间c90pui9n1#
您可以使用
period_range
创建一个新的索引,并重新索引现有的df以创建一个新的 Dataframe ,然后填充。对于
Wells
列,执行cumsum
,然后应用np.unique