同node01/node02/node03,修改hosts文件到C:\Windows\System32\drivers\etc\hosts
linux平台同理
纯python依赖:
pip install -r requirements.txt
有C依赖,mysqlclient。Linux安装mysql/MariaDB的dev和python版本的dev; windows找到对应版本下载 https://www.lfd.uci.edu/~gohlke/pythonlibs/#mysqlclient
pip install xx.whl
必须cd到py-spark-cdh\mysite
目录下,终端窗口执行./start.sh
swagger接口文档: http://node01:8000/cdh/swagger/
功能首页: http://node01:8000/cdh/
新版功能入口(文件检索): http://node01:8000/cdh/ui#/fileSearch
主要参考自己的笔记
次要参考别人的教程
hadoop3.x在hdfs-site.xml中加入:
<property>gv6
<name>dfs.permissions.enabled</name>
<value>false</value>
</property>
单机的集群,使用python的hdfs模块upload函数不能 n_threads=-1
,因为速度太快,开发用机器会集群异常。可以设为n_threads=2
windows安装spark(一定要spark-3.1.2-bin-hadoop3.2,不要3.2.0版本): https://phoenixnap.com/kb/install-spark-on-windows-10
python manage.py makemigrations cdh
#python manage.py sqlmigrate cdh 0002
python manage.py migrate
如果没有生成新表:
python manage.py createsuperuser
https://3emgk3.axshare.com/#g=1&p=%E6%95%B0%E6%8D%AE%E5%BA%93%E7%95%8C%E9%9D%A2
流程图: https://www.processon.com/diagraming/5eca41171e08530a9b198440
/static/cdh/
node01: 202.118.199.11 277439732 Haichao888
node02: 202.118.199.12 703720614 pq211ju1 haichao123.
node03: 202.118.199.13 893058426 070923
内网windows: 846448280 465632
node01-03: cpu: 40C mem: 256G hdd: 8T ssd: 512G
node01: NVIDIA GeForce RTX 3080
node02: NVIDIA GeForce RTX 3090 NVIDIA GeForce RTX 3090
node03: NVIDIA GeForce RTX 3080 NVIDIA GeForce RTX 3080
数据库版本: 5.5.68-MariaDB 数据库账号密码: root/123 metastore: hive/hive amon: amon/amon scm: scm/scm
数据库账号密码,cdh用 com.cloudera.cmf.db.type=mysql com.cloudera.cmf.db.host=localhost com.cloudera.cmf.db.name=scm com.cloudera.cmf.db.user=scm com.cloudera.cmf.db.setupType=EXTERNAL com.cloudera.cmf.db.password=scm
安装教程参考
CREATE DATABASE scm DEFAULT CHARACTER SET utf8 DEFAULT COLLATE utf8_general_ci; GRANT ALL ON scm.* TO 'scm'@'%' IDENTIFIED BY 'scm'; CREATE DATABASE metastore DEFAULT CHARACTER SET utf8 DEFAULT COLLATE utf8_general_ci; GRANT ALL ON metastore.* TO 'hive'@'%' IDENTIFIED BY 'hive'; CREATE DATABASE amon DEFAULT CHARACTER SET utf8 DEFAULT COLLATE utf8_general_ci; GRANT ALL ON amon.* TO 'amon'@'%' IDENTIFIED BY 'amon'; CREATE DATABASE oozie DEFAULT CHARACTER SET utf8 DEFAULT COLLATE utf8_general_ci; GRANT ALL ON oozie.* TO 'oozie'@'%' IDENTIFIED BY 'oozie'; CREATE DATABASE hue DEFAULT CHARACTER SET utf8 DEFAULT COLLATE utf8_general_ci; GRANT ALL ON hue.* TO 'hue'@'%' IDENTIFIED BY 'hue';
sudo /opt/cloudera/cm/schema/scm_prepare_database.sh mysql scm scm scm sudo /opt/cloudera/cm/schema/scm_prepare_database.sh mysql metastore hive hive sudo /opt/cloudera/cm/schema/scm_prepare_database.sh mysql amon amon amon sudo /opt/cloudera/cm/schema/scm_prepare_database.sh mysql hue hue hue sudo /opt/cloudera/cm/schema/scm_prepare_database.sh mysql oozie oozie oozie
server_host: /etc/cloudera-scm-agent/config.ini
解决agent状态不良
rm -f /var/lib/cloudera-scm-agent/cm_guid systemctl restart cloudera-scm-agent
查看日志
tail -f /var/log/cloudera-scm-agent/cloudera-scm-agent.log
tail -f /var/log/cloudera-scm-server/cloudera-scm-server.log
rm -rf /data/dfs
解析csv,按列头取值,30列内,你们提供列头
此处可能存在不合适展示的内容,页面不予展示。您可通过相关编辑功能自查并修改。
如您确认内容无涉及 不当用语 / 纯广告导流 / 暴力 / 低俗色情 / 侵权 / 盗版 / 虚假 / 无价值内容或违法国家有关法律法规的内容,可点击提交进行申诉,我们将尽快为您处理。