1 Star 2 Fork 3

Kunpeng/Spark-ml-algo-lib

加入 Gitee
与超过 1200万 开发者一起发现、参与优秀开源项目,私有仓库也完全免费 :)
免费加入
文件
克隆/下载
贡献代码
同步代码
取消
提示: 由于 Git 不支持空文件夾,创建文件夹后会生成空的 .keep 文件
Loading...
README

kal-test

Description

The Kunpeng algorithm library test tool can be used to test machine learning and graph analysis algorithms.

Compilation Tutorial

Prerequisites

  1. The Maven compilation environment has been configured.
  2. The algorithm software package has been obtained.

Procedure

  1. Go to the Spark-ml-algo-lib/tools/kal-test directory in the compilation environment.
  2. Install the dependencies.
    Take spark 2.3.2 as an example, the install command is as follows:
    mvn install:install-file -DgroupId=org.apache.spark -DartifactId=boostkit-graph-kernel-client_2.11 -Dversion=2.2.0 -Dclassifier=spark2.3.2 -Dfile=boostkit-graph-kernel-client_2.11-2.2.0-spark2.3.2.jar -Dpackaging=jar -DgeneratePom=true
    mvn install:install-file -DgroupId=org.apache.spark -DartifactId=boostkit-ml-kernel-client_2.11 -Dversion=2.2.0 -Dclassifier=spark2.3.2 -Dfile=boostkit-ml-kernel-client_2.11-2.2.0-spark2.3.2.jar -Dpackaging=jar -DgeneratePom=true
  3. Run the compile command:
    mvn clean install
  4. View the kal-test_2.11-0.1.jar file generated in Spark-ml-algo-lib/tools/kal-test/target.

Deployment and Usage Description

  1. Deploy the kal-test folder in the test environment, for example, in the /home/test/boostkit/ directory. If the directory does not exist, create one.
    mkdir -p /home/test/boostkit/
  2. Go to the directory.
    cd /home/test/boostkit/kal-test/
  3. Save the obtained boostkit-graph-kernel-scala_version-kal_version-spark_version-aarch64.jar, boostkit-graph-acc_scala_version-kal_version-spark_version.jar, and boostkit-graph-core_scala_version-kal_version-spark_version.jar files to /home/test/boostkit/kal-test/lib.
  4. Go to the /home/test/boostkit/kal-test directory.
    cd /home/test/boostkit/kal-test
  5. Run the following command in the /home/test/boostkit/kal-test/ directory (taking the PageRank algorithm as an example):
    bash bin/graph/pr_run.sh uk_2002 run no
  6. Check the algorithm running status.

Algorithm and Dataset

Algorithm Dataset Interface
PageRank uk_2002 run

References

  1. Open source KNN: https://github.com/saurfang/spark-knn.git
  2. Open source BFS: https://github.com/prasad223/GraphxBFS
  3. Open source DBSCAN: https://github.com/alitouka/spark_dbscan
  4. Open source ClusteringCoefficient: https://github.com/apache/spark/pull/9150/files
  5. Open source Betweenness: https://github.com/Sotera/distributed-graph-analytics/tree/master/dga-graphx/src/main/scala/com/soteradefense/dga/graphx/hbse
  6. Open source Node2Vec: https://github.com/QuanLab/node2vec-spark
  7. Open source SubgraphMatching: https://datalab.snu.ac.kr/pegasusn/scala-apidoc/#pegasus.spark.subgraph.PSE
  8. Open source XGBoost: https://github.com/dmlc/xgboost/tree/v1.1.0
马建仓 AI 助手
尝试更多
代码解读
代码找茬
代码优化
1
https://gitee.com/kunpengcompute/Spark-ml-algo-lib.git
git@gitee.com:kunpengcompute/Spark-ml-algo-lib.git
kunpengcompute
Spark-ml-algo-lib
Spark-ml-algo-lib
master

搜索帮助