hadoop - HIVE: How to Parse / Ignore SLF4J Header in JSON Log Message -
i working on hive demo, , want perform , aggregation query on file contains json messages, prefaced log4j style message @ beginning of each log line:
20:49:07.962 [main] info com.example.application - {"docid":"abc","user":{"id":1236,"username":"larry1234","name":"larry","shippingaddress":{"address1":"789 main st.","address2":"","city":"durham","state":"nc","postalcode":"27713"},"orders":[{"itemid":1111,"orderdate":"11/11/2012"},{"itemid":2222,"orderdate":"12/12/2012"}]}} i have large volume of such records, , working on hive demo. aware of hive-json-serde. how tell hive ignore log4j preamble?
you can use get_json_object function extract json data record , build on. blog can of help. http://thornydev.blogspot.in/2013/07/querying-json-records-via-hive.html
Comments
Post a Comment