# zhihu **Repository Path**: 2779626653/zhihu ## Basic Information - **Project Name**: zhihu - **Description**: 爬取问答 - **Primary Language**: Python - **License**: Not specified - **Default Branch**: master - **Homepage**: None - **GVP Project**: No ## Statistics - **Stars**: 0 - **Forks**: 0 - **Created**: 2017-07-21 - **Last Updated**: 2020-12-18 ## Categories & Tags **Categories**: Uncategorized **Tags**: None ## README #zhihu 爬取问答 pip install fake-useragent http://www.cnblogs.com/jinxiao-pu/category/984395.html #禁用cookie,可以降低识别爬虫的概率 COOKIES_ENABLED = False #开启下载延迟 AUTOTHROTTLE_ENABLED = True #设置下载延迟 单位s DOWNLOAD_DELAY = 10 #debug 打印延迟 AUTOTHROTTLE_DEBUG = False scrapy #spider开启cookies #涉及到登录,须要开启cookies custom_settings ={ "COOKIES_ENABLED":True }