the data that I use is log from hadoop, my problem is hadoop log from cluster, the schema I use is filebeat --> logstash --> solr, I use logstash config to parse the hadoop log, the hadoop log is inputted to the logstash via filebeat then output from the logstash indexed to the solr
Pada tanggal Sen, 13 Apr 2020 pukul 19.07 Erick Erickson < erickerick...@gmail.com> menulis: > I don’t quite know what you’re asking about. Is that input or intput to > Solr? Or is it output from logstash? > > What are you indexing? Because that doesn't look like data from a solr log. > > You might want to review: https://wiki.apache.org/solr/UsingMailingLists > > Best, > Erick > > > On Apr 13, 2020, at 12:24 AM, sefty nindyastuti <seftyni...@gmail.com> > wrote: > > > > I have a problem when indexing log data clusters in solr using logstash > and filebeat. there are double quotes in the solr index results, > > how to solve this problem, please help > > > > expect the results of the index that appears in solr as below: > > > > { > > "input": "log" > > "hostname": "localhost" > > "id": "22eddbc9-e60f-29cd-a352-b40154ba1736", > > "type": "filebeat" > > "ephemeral_id": "1a31d6e0-8ed9-1307-215f-5dfd361364c9" > > "version": "7.6.1" > > "offset": "2061794 " > > "path": " /var/log/hadoop/hdfs/hadoop-hdfs-secondarynamenode-xxxxxx.log " > > "host": "localhostxxx", > > "message": "2020-04-11 19: 04: 28,575 INFO common.Util > (Util.java:receiveFile(314)) - Combined time for file downloads and fsync > to all disks stores 0.02s. The file download stores 0.02s at 58750.00 KB / > s Synchronous (fsync) write to disk of / hadoop / hdfs / namesecondary / > current / edits_tmp_xxxxxxxx " > > { > > > >