返回空列表的Scrapy shell

s4chpxco  于 2022-11-09  发布在  Shell
关注(0)|答案(1)|浏览(145)

我试图刮从这个网站https://gol.gg/game/stats/43023/page-game/使用scrapy外壳,但它总是返回空列表或什么都没有.我试过使用:response.css('').get()response.xpath('').extract()与Selectogadjet或源代码,但我只是不能刮任何东西。如果这个网站不可刮或我我做的事情真的错了,我是一个新手在网页抓取。

xoefb8l8

xoefb8l81#

response.xpath('').extract()应失败,因为xpath表达式无效。
它对我来说很好,复制步骤:

scrapy shell https://gol.gg/game/stats/43023/page-game/

response.xpath('//*')

输出量:

[<Selector xpath='//*' data='<html lang="en">\n<head>\n             ...'>,
 <Selector xpath='//*' data='<head>\n                              ...'>,
 <Selector xpath='//*' data='<script>var gameid=43023;</script>'>,
 <Selector xpath='//*' data='<title>BLG vs OMG game 1 - LPL Summer...'>,
 <Selector xpath='//*' data='<base href="https://gol.gg/game/">'>,
 <Selector xpath='//*' data='<meta name="description" content="Vie...'>,
 <Selector xpath='//*' data='<meta name="keywords" content="lol, l...'>,
 <Selector xpath='//*' data='<meta name="twitter:card" content="su...'>,
 <Selector xpath='//*' data='<meta name="twitter:site" content="@b...'>,
 <Selector xpath='//*' data='<meta name="twitter:title" content="B...'>,
 <Selector xpath='//*' data='<meta name="twitter:description" cont...'>,
 <Selector xpath='//*' data='<meta name="twitter:image" content="h...'>,
 <Selector xpath='//*' data='<link rel="shortcut icon" href="../fa...'>,
 <Selector xpath='//*' data='<link rel="stylesheet" href="https://...'>,
 <Selector xpath='//*' data='<script src="https://code.jquery.com/...'>,
 <Selector xpath='//*' data='<script src="https://cdnjs.cloudflare...'>,
 <Selector xpath='//*' data='<script src="https://stackpath.bootst...'>,
 <Selector xpath='//*' data='<link href="https://fonts.googleapis....'>,
 <Selector xpath='//*' data='<link rel="stylesheet" type="text/css...'>,
 <Selector xpath='//*' data='<meta name="viewport" content="width=...'>,
 <Selector xpath='//*' data='<script>(function(w,d,s,l,i){w[l]=w[l...'>,
 <Selector xpath='//*' data='<body>\n<!-- Google Tag Manager (noscr...'>,
 <Selector xpath='//*' data='<noscript><iframe src="https://www.go...'>,
 <Selector xpath='//*' data='<iframe src="https://www.googletagman...'>,
 <Selector xpath='//*' data='<link href="../_lib/selectize/css/sel...'>,
 <Selector xpath='//*' data='<script src="https://ajax.googleapis....'>,
 <Selector xpath='//*' data='<script src="../_lib/selectize/js/sta...'>,
 <Selector xpath='//*' data='<link rel="stylesheet" href="https://...'>,
 <Selector xpath='//*' data='<div class="container-fluid main">\n  ...'>,
 <Selector xpath='//*' data='<header>\n   <div class="row row-fluid...'>,
 <Selector xpath='//*' data='<div class="row row-fluid">\n        \n...'>,
 <Selector xpath='//*' data='<div class="col-12 no-padding">\n   <n...'>,
 <Selector xpath='//*' data='<nav class="navbar navbar-dark menu">...'>,
 <Selector xpath='//*' data='<div class="pull-left hidden-xs">\n   ...'>,
 <Selector xpath='//*' data='<div class="pull-left">\n             ...'>,
 <Selector xpath='//*' data='<a href="../esports/home/"><img src="...'>,
 <Selector xpath='//*' data='<img src="../_img/gol_esports.png" cl...'>,
 <Selector xpath='//*' data='<a href="../esports/home/"><img src="...'>,
 <Selector xpath='//*' data='<img src="../_img/gol_esports_mini.pn...'>,
'...']

相关问题