在PandasDataFrame中移动句子中的特定单词

eqoofvh9 于 2022-12-09 发布在其他

关注(0)|答案(3)|浏览(132)

def order_word(s, word, delta):
        words = s.split()
        oldpos = words.index(word)
        words.insert(oldpos+delta, words.pop(oldpos))
        return ' '.join(words)

有人能帮我构建代码吗？提前感谢。

pandas

来源：https://stackoverflow.com/questions/74684951/moving-specific-word-within-sentence-in-a-pandas-dataframe

3条答案

按热度按时间

blpfk2vs1#

下面是一个使用pandas.Series.str.split和sorted的命题：

df["Column A"] = (
                    df["Column A"]
                        .str.split()
                        .apply(lambda x: " ".join(sorted(x, key=len, reverse=True)))
                  )

#输出：

print(df)
     Column A
0  abcdefg pt
1   fghikl cv
2    abcdg pt
3    opqrs cv
4   ststst bp
5    qwert bp

赞(0）回复(0）举报 2022-12-09

ctzwtxfj2#

可以将正则表达式与str.replace一起使用：

df['Column A'] = df['Column A'].str.replace(r'\s*\b(cv|pt|bp)\b\s*(.*$)',
                                            r'\2 \1', regex=True)

输出（为清楚起见，作为新列）：

Column A    Column B
0  pt abcdefg  abcdefg pt
1   cv fghikl   fghikl cv
2    abcdg pt    abcdg pt
3    opqrs cv    opqrs cv
4   bp ststst   ststst bp
5    qwert bp    qwert bp

regex demo

赞(0）回复(0）举报 2022-12-09

rur96b6h3#

示例

data = {'Column A': {0: 'pt abcdefg',1: 'cv fghikl',2: 'abcdg pt',3: 'opqrs cv',4: 'bp ststst',5: 'qwert bp',6: 'aaaa pt cc'}}
df = pd.DataFrame(data)

df：

Column A
0   pt abcdefg
1   cv fghikl
2   abcdg pt
3   opqrs cv
4   bp ststst
5   qwert bp
6   aaaa pt cc

代码

s = (df['Column A'].str.replace(r'(.*)(pt|cv|bp)(.*)', r'\1 \3 \2', regex=True)
   .str.replace(r'(\s)+', r'\1', regex=True))

输出（s）：

0    abcdefg pt
1     fghikl cv
2      abcdg pt
3      opqrs cv
4     ststst bp
5      qwert bp
6    aaaa cc pt
dtype: object

使s成为列
我做这个代码是为了中间有pt或cv。

更新

提问者说不存在中间的情况。
然后使用以下代码：

df['Column A'].str.replace(r'^(pt|cv|bp)[ ](.+)', r'\2 \1', regex=True)

赞(0）回复(0）举报 2022-12-09

我来回答

在PandasDataFrame中移动句子中的特定单词

3条答案

#输出：

相关问题

热门标签

最新问答