2 Star 6 Fork 5

AmCoder / AmCoder

加入 Gitee
与超过 1200万 开发者一起发现、参与优秀开源项目,私有仓库也完全免费 :)
免费加入
克隆/下载
Spark通过API连接es.md 1.25 KB
一键复制 编辑 原始数据 按行查看 历史
AmCoder 提交于 2020-06-05 15:35 . 调整目录结构和新增
/**
pom依赖
<dependency>
   	<groupId>org.elasticsearch</groupId>
 	<artifactId>elasticsearch-hadoop</artifactId>
    <version>2.2.0-m1</version>
</dependency>
**/
import data.spark.batch.cardbin.util.CardBinFields;
import org.apache.spark.api.java.JavaRDD;
import org.apache.spark.sql.SQLContext;
//import ....

public class SparkConnectionEs{
    //spark直连es并通过CardBinFields实体转为sparkRdd从而注册成table
    private static String sourceIP = "192.168.23.23";
    private static String esPath = "ybs_cardbin_info_bak/cardbin";//es_index/es_type
    public static void main(String[] args) throws Exception {
    JavaRDD<CardBinFields> esdataRdd = JavaEsSpark.esRDD(sparkContext, esPath).map(new Function<Tuple2<String, Map<String, Object>>, CardBinFields>() {
			private static final long serialVersionUID = 1L;

			public CardBinFields call(Tuple2<String, Map<String, Object>> v1) throws Exception {
				CardBinFields cardbin = new CardBinFields();
				cardbin.setId(v1._1);
				cardbin.setBank_no(v1._2.get("bank_no").toString());
				return cardbin;
			}
		});
        DataFrame tfcardnoDF = sqlContext.createDataFrame(esdataRdd, CardBinFields.class).select("id", "bank_no");
		tfcardnoDF.registerTempTable("ES_FIELDS");
    }
}
其他
1
https://gitee.com/AmCoder/AmCoder.git
git@gitee.com:AmCoder/AmCoder.git
AmCoder
AmCoder
AmCoder
master

搜索帮助