ElasticSearch安装入门
ElasticSearch介绍:
ElasticSearch是一个基于Lucene的搜索服务器 ,elasticsearch封装了抽象接口用于连接Lucene
Lucene是一套用于全文检索和搜寻的开源程式库
安装很简单:
解压就可以用,几乎是o配置
分布式全文检索服务器
要求是两个节点,最好是三个及以上
安装步骤:
------------------------------------------环境准备-----------------------------------------------
###【在多台机器上执行下面的命令】###
#es启动时需要使用非root用户,所有创建一个bigdata用户:
useradd bigdata
#为hadoop用户添加密码:
echo 123456 | passwd --stdin bigdata
#将bigdata添加到sudoers
echo "bigdata ALL = (root) NOPASSWD:ALL" | tee /etc/sudoers.d/bigdata
chmod 0440 /etc/sudoers.d/bigdata
#解决sudo: sorry, you must have a tty to run sudo问题,在/etc/sudoer注释掉 Default requiretty 一行
sudo sed -i 's/Defaults requiretty/Defaults:bigdata !requiretty/' /etc/sudoers
#创建一个bigdata目录
mkdir /{bigdata,data}
#给相应的目录添加权限
chown -R bigdata:bigdata /{bigdata,data}
------------------------------------------ES安装-----------------------------------------------
We recommend installing the Java 8 update 20 or later, or Java 7 update 55 or later.
Previous versions of Java 7 are known to have bugs that can cause index corruption and data loss.
Elasticsearch will refuse to start if a known-bad version of Java is used.
###【切换到bigdata用户安装】###
1.安装jdk(jdk要求1.8.20或1.7.55以上)
2.上传es安装包
3.解压es
tar -zxvf elasticsearch-2.3.1.tar.gz -C /bigdata/
4.修改配置
vi /bigdata/elasticsearch-2.3.1/config/elasticsearch.yml
#集群名称,通过组播的方式通信,通过名称判断属于哪个集群
cluster.name: bigdata
#节点名称,要唯一
node.name: es-1
#数据存放位置
path.data: /data/es/data
#日志存放位置
path.logs: /data/es/logs
#es绑定的ip地址
network.host: 172.16.0.14
#初始化时可进行选举的节点
discovery.zen.ping.unicast.hosts: ["node-4.itcast.cn", "node-5.itcast.cn", "node-6.itcast.cn"]
5.使用scp拷贝到其他节点
scp -r elasticsearch-2.3.1/ node-5.itcast.cn:$PWD
scp -r elasticsearch-2.3.1/ node-6.itcast.cn:$PWD
6.在其他节点上修改es配置,需要修改的有node.name和network.host
7.启动es(/bigdata/elasticsearch-2.3.1/bin/elasticsearch -h查看帮助文档)
/bigdata/elasticsearch-2.3.1/bin/elasticsearch -d
启动报错: max file descriptors [4096] for elasticsearch process is too low, increase to at least [65535] max virtual memory areas vm.max_map_count [65530] is too low, increase to at least [262144] 解决办法: 1、编辑 /etc/security/limits.conf,追加以下内容; * soft nofile 65536 * hard nofile 65536 2、在/etc/sysctl.conf文件最后添加一行 vm.max_map_count=262144
8.用浏览器访问es所在机器的9200端口
自带主节点选举机制,最好配置奇数个,防止闹了,选举不依赖zookeeper
可视化管理界面:
#es安装插件下载es插件
/bigdata/elasticsearch-2.3.1/bin/plugin install mobz/elasticsearch-head
[hadoop@master es]$ bin/plugin install mobz/elasticsearch-head
-> Installing mobz/elasticsearch-head...
Trying https://github.com/mobz/elasticsearch-head/archive/master.zip ...
Downloading ....................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................DONE
Verifying https://github.com/mobz/elasticsearch-head/archive/master.zip checksums if available ...
NOTE: Unable to verify checksum for downloaded plugin (unable to find .sha1 or .md5 file to verify)
Installed head into /export/servers/es/plugins/head
#本地方式安装head插件
./plugin install file:///home/bigdata/elasticsearch-head-master.zip
#访问head管理页面
elasticSearch简单使用:
RESTful接口URL的格式:
http://localhost:9200/<index>/<type>/[<id>]
其中index、type是必须提供的。
id是可选的,不提供es会自动生成。
index、type将信息进行分层,利于管理。
index可以理解为数据库;type理解为数据表;id相当于数据库表中记录的主键,是唯一的。
#向store索引中添加一些书籍 curl -XPUT 'http://172.16.0.14:9200/store/books/1' -d '{ "title": "Elasticsearch: The Definitive Guide", "name" : { "first" : "Zachary", "last" : "Tong" }, "publish_date":"2015-02-06", "price":"49.99" }' #通过浏览器查询 http://172.16.0.14:9200/store/books/1 #在linux中通过curl的方式查询 curl -XGET 'http://172.16.0.14:9200/store/books/1' #在添加一个书的信息 curl -XPUT 'http://172.16.0.14:9200/store/books/2' -d '{ "title": "Elasticsearch Blueprints", "name" : { "first" : "Vineeth", "last" : "Mohan" }, "publish_date":"2015-06-06", "price":"35.99" }' # 通过ID获得文档信息 curl -XGET 'http://172.16.0.14:9200/bookstore/books/1' #在浏览器中查看 http://172.16.0.14:9200/bookstore/books/1 # 通过_source获取指定的字段 curl -XGET 'http://172.16.0.14:9200/store/books/1?_source=title' curl -XGET 'http://172.16.0.14:9200/store/books/1?_source=title,price' curl -XGET 'http://172.16.0.14:9200/store/books/1?_source' #可以通过覆盖的方式更新 curl -XPUT 'http://172.16.0.14:9200/store/books/1' -d '{ "title": "Elasticsearch: The Definitive Guide", "name" : { "first" : "Zachary", "last" : "Tong" }, "publish_date":"2016-02-06", "price":"99.99" }' # 或者通过 _update API的方式单独更新你想要更新的 curl -XPOST 'http://172.16.0.14:9200/store/books/1/_update' -d '{ "doc": { "price" : 88.88 } }' curl -XGET 'http://172.16.0.14:9200/store/books/1' #删除一个文档 curl -XDELETE 'http://172.16.0.14:9200/store/books/1' # 最简单filter查询 # SELECT * FROM books WHERE price = 35.99 # filtered 查询价格是35.99的 curl -XGET 'http://172.16.0.14:9200/store/books/_search' -d '{ "query" : { "filtered" : { "query" : { "match_all" : {} }, "filter" : { "term" : { "price" : 35.99 } } } } }' #指定多个值 curl -XGET 'http://172.16.0.14:9200/store/books/_search' -d '{ "query" : { "filtered" : { "filter" : { "terms" : { "price" : [35.99, 99.99] } } } } }' # SELECT * FROM books WHERE publish_date = "2015-02-06" curl -XGET 'http://172.16.0.14:9200/bookstore/books/_search' -d '{ "query" : { "filtered" : { "filter" : { "term" : { "publish_date" : "2015-02-06" } } } } }' # bool过滤查询,可以做组合过滤查询 # SELECT * FROM books WHERE (price = 35.99 OR price = 99.99) AND (publish_date != "2016-02-06") # 类似的,Elasticsearch也有 and, or, not这样的组合条件的查询方式 # 格式如下: # { # "bool" : { # "must" : [], # "should" : [], # "must_not" : [], # } # } # # must: 条件必须满足,相当于 and # should: 条件可以满足也可以不满足,相当于 or # must_not: 条件不需要满足,相当于 not curl -XGET 'http://172.16.0.14:9200/bookstore/books/_search' -d '{ "query" : { "filtered" : { "filter" : { "bool" : { "should" : [ { "term" : {"price" : 35.99}}, { "term" : {"price" : 99.99}} ], "must_not" : { "term" : {"publish_date" : "2016-02-06"} } } } } } }' # 嵌套查询 # SELECT * FROM books WHERE price = 35.99 OR ( publish_date = "2016-02-06" AND price = 99.99 ) curl -XGET 'http://172.16.0.14:9200/bookstore/books/_search' -d '{ "query" : { "filtered" : { "filter" : { "bool" : { "should" : [ { "term" : {"price" : 35.99}}, { "bool" : { "must" : [ {"term" : {"publish_date" : "2016-02-06"}}, {"term" : {"price" : 99.99}} ] }} ] } } } } }' # range范围过滤 # SELECT * FROM books WHERE price >= 20 AND price < 100 # gt : > 大于 # lt : < 小于 # gte : >= 大于等于 # lte : <= 小于等于 curl -XGET 'http://172.16.0.14:9200/store/books/_search' -d '{ "query" : { "filtered" : { "filter" : { "range" : { "price" : { "gt" : 20.0, "lt" : 100 } } } } } }' # 另外一种 and, or, not查询 # 没有bool, 直接使用and , or , not # 注意: 不带bool的这种查询不能利用缓存 # 查询价格既是35.99,publish_date又为"2015-02-06"的结果 curl -XGET 'http://172.16.0.14:9200/bookstore/books/_search' -d '{ "query": { "filtered": { "filter": { "and": [ { "term": { "price":59.99 } }, { "term": { "publish_date":"2015-02-06" } } ] }, "query": { "match_all": {} } } } }' http://172.16.0.14:9200/bookstore/books/_search #es安装插件下载es插件 /bigdata/elasticsearch-2.3.1/bin/plugin install elasticsearch/marvel/latest #访问head管理页面 http://172.16.0.14:9200/_plugin/marvel