๋ณธ๋ฌธ ๋ฐ”๋กœ๊ฐ€๊ธฐ

๐Ÿ’ปTech/๐ŸHIVE17

[Hive] beeline ๊ณ„์ •,ํŒจ์Šค์›Œ๋“œ ์—†์ด ์ž๋™ ๋กœ๊ทธ์ธ ์„ค์ • hive 3.0 ๊ฐ™์€ ๊ฒฝ์šฐ ๋ชจ๋“  hive ์ ‘์†์„ beeline์œผ๋กœ ์ˆ˜ํ–‰ํ•ด์•ผํ•˜๋Š”๋ฐ ๋งค๋ฒˆ ๊ณ„์ •,ํŒจ์Šค์›Œ๋“œ ์ž…๋ ฅํ•˜๋Š” ๋ถˆํŽธํ•จ๊ณผ ์‰˜ ์ž‘์„ฑํ•  ๊ฒฝ์šฐ ๋ณด์•ˆ๋ฌธ์ œ๊ฐ€ ์žˆ๋Š”๋ฐ ์•„๋ž˜ xml ํŒŒ์ผ์„ ~/.beeline/ ๋ฐ‘์— ์ƒ์„ฑํ•ด์ฃผ๋ฉด ์ž๋™ ๋กœ๊ทธ์ธ์ด ๊ฐ€๋Šฅํ•ฉ๋‹ˆ๋‹ค. ๐Ÿ’ก LDAP ์ž๋™ ์ธ์ฆ ๋ฐฉ๋ฒ• vi /home/admin/.beeline/beeline-hs2-connection.xml --------------------------------------------------------------- beeline.hs2.connection.user hive beeline.hs2.connection.password hive_password --------------------------------------------------------.. 2019. 9. 23.
[Hive] parquet ์••์ถ• ์„ค์ • CREATE TABLE test(a int, b string) ROW FORMAT DELIMITED FIELDS TERMINATED BY ',' STORED AS PARQUET TBLPROPERTIES ("parquet.compression"="SNAPPY"); 2019. 5. 29.
[Hive] ์ฟผ๋ฆฌ๊ฒฐ๊ณผ ํŒŒ์ผ ์ถ”์ถœ โ—พ hive -e ๋ช…๋ น์–ด ์‚ฌ์šฉํ•˜์—ฌ ํŒŒ์ผ ์ €์žฅํ•˜๋Š” ๋ฐฉ๋ฒ• - beeline์œผ๋กœ select ๊ฒฐ๊ณผ๋ฅผ ์ถœ๋ ฅํ•  ๊ฒฝ์šฐ ์ปฌ๋Ÿผ๋ช…๊ณผ ํ…Œ์ด๋ธ” ํ˜•ํƒœ๋„ ๊ฐ™์ด ๋‚˜์˜ค๊ธฐ ๋•Œ๋ฌธ์—, outputformat=tsv2 ์˜ต์…˜์œผ๋กœ ์ œ๊ฑฐํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค. hive --outputformat=tsv2 -e "select * from db.tb" > /local_path/test.dat โ—พ delimiter ๋ณ€๊ฒฝํ•˜์—ฌ ํŒŒ์ผ ์ €์žฅํ•˜๋Š” ๋ฐฉ๋ฒ• (hive to file) - ์•„๋ž˜๋‚ด์šฉ์—์„œ ์ €์žฅํ•  ํŒŒ์ผ์˜ ๋””๋ ‰ํ† ๋ฆฌ ์œ„์น˜์™€ ์›ํ•˜๋Š” ๊ตฌ๋ถ„์ž, ์ฟผ๋ฆฌ ์„ค์ • ํ›„ hql ํŒŒ์ผ ์ €์žฅ INSERT OVERWRITE LOCAL DIRECTORY './directory_name' ROW FORMAT DELIMITED FIELDS TERMINATED BY '\t' STORED AS .. 2019. 5. 29.
[Hive] header ์ œ๊ฑฐ ์˜ต์…˜ hive์—์„œ ๋ฐ์ดํ„ฐ ์กฐํšŒ ์‹œ ํŒŒ์ผ์— ํ—ค๋”๊ฐ€ ์žˆ๋Š” ๊ฒฝ์šฐ ํŒŒ์ผ์—์„œ ์ง์ ‘ ํ—ค๋”๋ฅผ ์ œ๊ฑฐํ•˜์ง€ ์•Š๊ณ  ์•„๋ž˜ ์˜ต์…˜์œผ๋กœ ๋Œ€์ฒด ๊ฐ€๋Šฅํ•ฉ๋‹ˆ๋‹ค. ์Šคํ‚ค๋งˆ ๋งˆ์ง€๋ง‰ ์ค„์— ์•„๋ž˜ ์˜ต์…˜ ์ถ”๊ฐ€ ํ•˜๊ฑฐ๋‚˜ ALTER TABLE ๋ช…๋ น์–ด๋กœ ์ ์šฉํ•ด ์ค๋‹ˆ๋‹ค. โ—พ ํ—ค๋” ์ œ๊ฑฐ ์˜ต์…˜ ์„ค์ • 1๋ฒˆ์งธ ๋ผ์ธ ์ œ๊ฑฐํ•˜๊ณ  ๋ฐ์ดํ„ฐ ์กฐํšŒ ๋ฉ๋‹ˆ๋‹ค. ALTER TABLE db_nm.tb_nm set tblproperties("skip.header.line.count"="1"); โ—พ ์ถ”๊ฐ€ํ•œ ์˜ต์…˜ ์ œ๊ฑฐ ALTER TABLE db_nm.tb_nm UNSET TBLPROPERTIES('skip.header.line.count'); 2019. 5. 16.
Hive ๋ช…๋ น์–ด ์ •๋ฆฌ โ—พ ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค ์ƒ์„ฑ CREATE DATABASE IF NOT EXISTS db_nm; โ—พ ํ…Œ์ด๋ธ” ์ƒ์„ฑ External : hive์—์„œ ํ…Œ์ด๋ธ” dropํ•  ๊ฒฝ์šฐ hdfs ๊ฒฝ๋กœ ๋ฐ ํŒŒ์ผ ๋ณด์กด HDFS ๊ฒฝ๋กœ์˜ ๋ฐ์ดํ„ฐ ๋ฟ๋งŒ ์•„๋‹ˆ๋ผ Amazon, Azure ๋“ฑ์˜ ํด๋ผ์šฐ๋“œ ์Šคํ† ๋ฆฌ์ง€๋กœ ์ง€์ • ๊ฐ€๋Šฅ Managed : hive์—์„œ ํ…Œ์ด๋ธ” dropํ•  ๊ฒฝ์šฐ hdfs ๊ฒฝ๋กœ ๋ฐ ํŒŒ์ผ ์‚ญ์ œ -- External Table CREATE EXTERNAL TABLE IF NOT EXISTS db_nm.table_nm ( a string comment 'a', b string comment 'b', c string comment 'c' ) comment 'table comment' PARTITIONED BY (DT STRING) ROW .. 2017. 2. 15.
[HIVE] Delimiter \u001E \u001E -> \036 [record seperator] ROW FORMAT DELIMITED FIELDS TERMINATED BY '\036' ์œผ๋กœ ์„ค์ • ํ›„ Desc ์กฐํšŒํ•˜๋ฉด \u001E ํ—ฅ์‚ฌ๊ฐ’์œผ๋กœ ๋‚˜์˜ต๋‹ˆ๋‹ค. ์•„๋ž˜ site ์•„์Šคํ‚คํ‘œ ์ฐธ๊ณ ํ•˜๋ฉด hive ๋”œ๋ฆฌ๋ฏธํ„ฐ ๊ด€๋ จํ•˜์—ฌ ์ดํ•ดํ•˜๊ธฐ ์‰ฝ์Šต๋‹ˆ๋‹ค. 2016. 11. 30.
[Hive] ํ…Œ์ด๋ธ” ์‚ญ์ œํ•˜์ง€ ์•Š๊ณ  drop database ํ•˜๋Š” ๋ฐฉ๋ฒ• temp ๋ฐ์ดํ„ฐ ๋ฒ ์ด์Šค๋กœ ํ…Œ์ด๋ธ” ๋ช…์„ ๋ณ€๊ฒฝํ•˜๊ณ  ์‚ญ์ œํ•ด์ฃผ๋ฉด ๋ฉ๋‹ˆ๋‹ค. CREATE DATABASE temp; USE targetDB; ALTER TABLE targetTable RENAME TO temp.targetTable ; DROP DATABASE targetDB; 2016. 10. 18.
[Hive] ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค ์Šคํ‚ค๋งˆ ์ „์ฒด ์ถ”์ถœ โ—พ hive์—์„œ ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค ์Šคํ‚ค๋งˆ๋ฅผ ํ†ต์งธ๋กœ(ํ•˜์œ„ ํ…Œ์ด๋ธ”๋“ค๊นŒ์ง€) ์ƒ์„ฑํ•˜๋Š” shell script #!/bin/bash OPTION="--showHeader=false --outputformat=tsv2" DATABASE_NM=`hive ${OPTION} -e "show databases;"` for database in $DATABASE_NM do TABLE_NM=`hive ${OPTION} -e "use ${database}; show tables;"` for table in $TABLE_NM do hive ${OPTION} -e "show create table ${database}.${table}" >> ${database}_schema.hql echo ";" >> ${database}_schema.. 2016. 10. 4.