1 Star 0 Fork 182

GSAM / Python爬虫

加入 Gitee
与超过 1200万 开发者一起发现、参与优秀开源项目,私有仓库也完全免费 :)
免费加入
该仓库未声明开源许可证文件(LICENSE),使用请关注具体项目描述及其代码上游依赖。
克隆/下载
爬取酷狗音乐.py 1.17 KB
一键复制 编辑 原始数据 按行查看 历史
import requests
from selenium import webdriver
from pyquery import PyQuery as pq
#获取歌名
def Access_to_the_song(C):
doc=pq(C)
The_song=doc('.audioName').text()
return The_song
#进入听歌的界面爬取HTML
def Browser_access(URL):
The_browser=webdriver.ChromeOptions()
The_browser.set_headless()
Drier=webdriver.Chrome(options=The_browser)
Drier.get(URL)
Print_source=Drier.page_source
doc = pq(Print_source)
Music_links = doc('.music').items()
for i in Music_links:
Music_downloads = i.attr('src')
with open(Access_to_the_song(Print_source) + '.mp3', 'wb')as p:
p.write(requests.get(Music_downloads).content)
#爬取热门歌曲
def Music_links():
URL = 'http://www.kugou.com/yy/rank/home/1-6666.html?from=rank'
headers={
'User-Agent':'Mozilla/5.0 (Windows NT 10.0; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/63.0.3239.132 Safari/537.36'
}
response = requests.get(URL,headers=headers).text
doc = pq(response)
test = doc('.pc_temp_songname').items()
for i in test:
c = i.attr('href')
Browser_access(c)
Music_links()
Python
1
https://gitee.com/dushuren/python_reptilian.git
git@gitee.com:dushuren/python_reptilian.git
dushuren
python_reptilian
Python爬虫
master

搜索帮助