# xhs_playwright **Repository Path**: wanjiasu/xhs_playwright ## Basic Information - **Project Name**: xhs_playwright - **Description**: 小红书playwright爬虫 - **Primary Language**: Unknown - **License**: Not specified - **Default Branch**: master - **Homepage**: None - **GVP Project**: No ## Statistics - **Stars**: 1 - **Forks**: 0 - **Created**: 2025-03-12 - **Last Updated**: 2025-12-10 ## Categories & Tags **Categories**: Uncategorized **Tags**: None ## README # 小红书 Playwright 爬虫 基于 Playwright 的小红书数据采集工具,支持搜索下拉词、博主信息和笔记内容的自动化获取。 ## 功能特点 - 搜索下拉词采集 - 博主个人信息获取 - 笔记内容和互动数据采集 - 自动化登录状态管理 ## 安装步骤 1. 克隆仓库 ```bash git clone https://gitee.com/wanjiasu/xhs_playwright.git cd xhs_playwright ``` 2. 虚拟环境 ```bash python -m venv env ``` windows ```bash ./env/Scripts/activate.bat ``` linux ```bash source env/bin/activate ``` 3. 安装依赖 ```bash python3 -m pip install -r requirements.txt ``` 4. 初始化登录状态 ```bash python first.py ``` ## 使用方法 ### 搜索下拉词获取 ```bash python xhs_search_dropdown/xhs_search_dropdown.py ``` ### 博主信息获取 ```bash python xhs_blogger_info/xhs_single_blogger_info_get.py ``` ### 笔记信息获取 ```bash python xhs_note_info/xhs_single_note_info_get.py ``` ### 笔记评论获取 ```bash python xhs_note_info/xhs_single_note_comments_get.py ``` ## 注意事项 - 请遵守小红书的使用条款和robots协议 - 避免频繁请求,防止IP被封 - 仅用于学习和研究目的