通过regex仅保留HTML标记之外[重复]

rdrgkggo 于 12个月前发布在其他

关注(0)|答案(1)|浏览(102)

此问题在此处已有答案：

RegEx match open tags except XHTML self-contained tags（36个回答）
4天前关闭。
我有一个列表如下：

<td class="News"><a href="ubuntu">Ubuntu</a></td>
  <td class="News" style="text-align: right" title="Yesterday: 2578">2571<img src="/web/20061130064026im_/http://distrowatch.com/images/other/adown.png" alt="<" title="Yesterday: 2578"></td>
  <td class="News"><a href="suse">openSUSE</a></td>
  <td class="News" style="text-align: right" title="Yesterday: 1943">1943<img src="/web/20061130064026im_/http://distrowatch.com/images/other/alevel.png" alt="=" title="Yesterday: 1943"></td>
  <td class="News"><a href="fedora">Fedora</a></td>
  <td class="News" style="text-align: right" title="Yesterday: 1420">1422<img src="/web/20061130064026im_/http://distrowatch.com/images/other/aup.png" alt=">" title="Yesterday: 1420"></td>
  <td class="News"><a href="mepis">MEPIS</a></td>

字符串
现在我想只保留Notepad++中的HTML标记或>*****<之外的内容
例如，在上面的列表中，只有以下内容必须保留，其他内容必须删除：

Ubuntu
2571
openSUSE
1943
Fedora
1422
MEPIS

型
我试着遵循regex，但它不准确，并保留额外的代码：

>([^<>]+)<

型
我的正则表达式问题在哪里？

regex

来源：https://stackoverflow.com/questions/77463941/keep-only-outside-of-html-tags-by-regex

1条答案

按热度按时间

hfwmuf9z1#

得到你例子中提到的结果。

查找内容：（？：<[\s\w="-：]+>\s*）+（\w+）.+
**替换为：**1美元

使用此屏幕中显示的选项。x1c 0d1x

赞(0）回复(0）举报 12个月前

我来回答

通过regex仅保留HTML标记之外[重复]

1条答案

相关问题

热门标签

最新问答