regex 在Python中使用正则表达式将文本拆分为speaker和text？

n1bvdmb6 于 2023-08-08 发布在 Python

关注(0)|答案(1)|浏览(76)

我有以下字符串：

JOHN SMITH, GLOBAL HEAD OF YOUTUBE : Good morning, good 
afternoon, everyone . Before I hand over to facebook, I want to give a quick reminder of the reporting 
changes that have taken effect this filming of a tv show.  
 

 
BOBBY DUDE, GROUP FROM FACEBOOK:     Thanks, john smith lets talk about movies and films we watch when we are bored parents.

字符串
我如何创建一个正则表达式模式来将文本拆分为speaker和text？例如，为了得到这个结果：

string1: (speaker = JOHN SMITH, GLOBAL HEAD OF YOUTUBE, text  =  Good morning, good 
afternoon, everyone . Before I hand over to facebook, I want to give a quick reminder of the reporting 
changes that have taken effect this filming of a tv show.  
 )

型
等等

regex

来源：https://stackoverflow.com/questions/76756499/split-text-into-speaker-and-text-with-regex-in-python

1条答案

按热度按时间

xmakbtuz1#

你可以试试（regex101）：

import re

text = """\
JOHN SMITH, GLOBAL HEAD OF YOUTUBE : Good morning, good
afternoon, everyone . Before I hand over to facebook, I want to give a quick reminder of the reporting
changes that have taken effect this filming of a tv show.


BOBBY DUDE, GROUP FROM FACEBOOK:     Thanks, john smith lets talk about movies and films we watch when we are bored parents. """

out = re.findall(
    r"^([^a-z:]+?)\s*:\s*(.*?)\s*(?=^[^a-z:]+?:|\Z)", text, flags=re.S | re.M
)

print(out)

字符串
印刷品：

[
    (
        "JOHN SMITH, GLOBAL HEAD OF YOUTUBE",
        "Good morning, good\nafternoon, everyone . Before I hand over to facebook, I want to give a quick reminder of the reporting\nchanges that have taken effect this filming of a tv show.",
    ),
    (
        "BOBBY DUDE, GROUP FROM FACEBOOK",
        "Thanks, john smith lets talk about movies and films we watch when we are bored parents.",
    ),
]

型

赞(0）回复(0）举报 2023-08-08

我来回答

regex 在Python中使用正则表达式将文本拆分为speaker和text？

1条答案

相关问题

热门标签

最新问答