对大家有帮助的问答会被标记为“推荐”,看完课程过来浏览一下别人提的问题,会帮你学得更全面
import scrapy from scrapy_splash.request import SplashRequest class BaiduSpider(scrapy.Spider): name = 'baidu' allowed_domains = ['baidu.com'] start_urls = ['http://www.baidu.com/'] def start_requests(self): for url in self.start_urls: yield SplashRequest(url) def parse(self, response): print(response.text)
运行时报错
老师,运行docker run -p 8050:8050 scrapinghub/splash
命令时成功,浏览器访问url报错
老师,为啥有些是这样的符号?如何解决
<title>ç¾åº¦å®å¨éªè¯</title>
老师,我这个图片的url复制到网站报403错误,代码报错如图1,图片url复制到网站为图2,检测网站图3,是网站不运行进行爬虫吗?
index_url ='https://www.kuaidaili.com/usercenter/overview' index_req = Request(index_url,headers =headers) index_resp = opener.open(index_req) print(index_resp.read().decode())
请问这里是什么意思,在登录账号后,还发送请求是为啥