有人能帮忙做mrjob和pymorphy2吗?我是python和hadoop的新手。我有点理解如何执行文本标记化,但我不理解如何使用pymorphy2从形态学上分解生成的标记。也许我做错了什么,但我不明白。这是我的密码:
from mrjob.job import MRJob
import re, pymorphy2
morph = pymorphy2.MorphAnalyzer()
WORD_RE = re.compile(r"[\w']+")
class MRMorphWord(MRJob):
def mapper(self, _, line):
for word in WORD_RE.findall(line):
yield (word.lower(), 1)
def reducer(self, _, word):
for i in word:
p = morph.parse(word)[0]
yield p
if __name__ == '__main__':
MRMorphWord.run()
以下是错误消息:
parse
word_lower = word.lower()
AttributeError: 'generator' object has no attribute 'lower'
暂无答案!
目前还没有任何答案,快来回答吧!