๋ณธ๋ฌธ ๋ฐ”๋กœ๊ฐ€๊ธฐ

๐Ÿ’ปTech159

[NiFi] content ๋‚ด์šฉ์„ attribute๋กœ ์ €์žฅํ•˜๋Š” ๋ฐฉ๋ฒ• NiFi Processor ์ค‘ ExtractText ํ”„๋กœ์„ธ์„œ์—์„œ ์ •๊ทœ์‹์„ ์‚ฌ์šฉํ•˜๋ฉด ๋ฉ๋‹ˆ๋‹ค. html ๋‚ด์šฉ์„ attribute์— ์ €์žฅํ•ด์„œ sql์— ์ง‘์–ด๋„ฃ๊ธฐ ์œ„ํ•ด ์ถ”์ถœํ–ˆ์Šต๋‹ˆ๋‹ค. ๐Ÿ‘จ‍๐Ÿ’ป ExtractText์—์„œ ๋ณ€๊ฒฝํ•œ properties ๋‚ด์šฉ Maximum Capture Group Length = 1048576 Enable DOTALL Mode = true Enable Multiline Mode = true body = (.*) 2022. 10. 20.
[NiFi] ORACLE, Impala Data Type ๋น„๊ต NiFI์—์„œ ORACLE ๋ฐ์ดํ„ฐ ์ˆ˜์ง‘ํ•  ๋•Œ ์ •๋ฆฌํ•œ ๋‚ด์šฉ์ž…๋‹ˆ๋‹ค. ๐Ÿ“ Data Type ORACLE Impala CLOB string varchar string number(int) decimal(20,0) number(double) decimal(20,3) timestamp timestamp date timestamp binary_double double โ—พ ExecuteSQLRecord Use Avro Logical Types: true Max Rows Per Flow File: 500000 Fetch Size: 500000 โ—พ Impala DBCP Validation query: select 1 2022. 9. 22.
[NiFi] Groovy-Java๋กœ ์—ฌ๋Ÿฌ ๋‚ ์งœ ๋ฝ‘๋Š” ๋ฐฉ๋ฒ• NiFi์—์„œ ์ œ๊ณตํ•˜๋Š” Groovy Script๋Š” ์ž๋ฐ” ํ˜ธํ™˜์ด ๊ฑฐ์˜ ๋˜๋ฏ€๋กœ ์ž๋ฐ” ์ฝ”๋“œ๋กœ ์ž‘์„ฑํ•˜์—ฌ๋„ ์ž˜ ๋Œ์•„๊ฐ Create ExecuteScript Processor PROPERTIES > Select Groovy > Write Script Body Script Contents import java.text.DateFormat; import java.text.ParseException; import java.text.SimpleDateFormat; import java.util.Calendar; import java.util.Date; //attribute์˜ ๋‚ ์งœ(yyyymmdd) ๊ธฐ์ค€๋ถ€ํ„ฐ -30์ผ๊นŒ์ง€ ์ถ”์ถœ flowFile = session.get(); if(!flowFile) return; dt = flo.. 2022. 9. 22.
zeppelin interpreter resource share mode (notebook pending) zeppelin ์—์„œ ์—ฌ๋Ÿฌ ํƒœ์Šคํฌ๋ฅผ ์‹คํ–‰ํ•˜๋ฉด, ์„ ํ–‰ ์ž‘์—…์ด ๋๋‚ ๋•Œ๊นŒ์ง€ ์ž‘์—…์ด pending ๊ฑธ๋ฆฌ๋Š” ๊ฒฝ์šฐ๊ฐ€ ์žˆ๋„ค์š” ๐Ÿ”— ๋งํฌ ์ฐธ๊ณ  https://zeppelin.apache.org/docs/0.8.0/usage/interpreter/interpreter_binding_mode.html 2022. 9. 19.
[NiFi] global variable(์ „์—ญ๋ณ€์ˆ˜) ์„ค์ • nifi์—์„œ global variable(์ „์—ญ๋ณ€์ˆ˜) ์„ค์ •ํ•˜์—ฌ ์—ฌ๋Ÿฌ Process Group์—์„œ ํ˜„์žฌ ๋‚ ์งœ(yyyyMMdd)๋ฅผ ์‚ฌ์šฉํ•˜๋Š” ์š”๊ฑด์ด ์žˆ์—ˆ๋Š”๋ฐ ๋ฐฉ๋ฒ• ์ฐพ๋Š๋ผ ์—„์ฒญ ๊ณ ์ƒํ–ˆ๋„ค์š” ๐Ÿ˜ญ Variables์™€ Parameter 2๊ฐ€์ง€ ๊ธฐ๋Šฅ์„ ์ œ๊ณตํ•˜๋Š”๋ฐ Parameter์—์„œ NiFi EL(Expression Language)์ด ๋จนํ˜€์„œ ๊ธ€๋กœ๋ฒŒ ๋‚ ์งœ ๋ณ€์ˆ˜๋ฅผ ์ถ”์ถœํ•  ์ˆ˜ ์žˆ์—ˆ์Šต๋‹ˆ๋‹ค. Variables์—์„œ๋Š” ${now():format('yyyyMMdd')} ์™€ ๊ฐ™์€ ํ‘œํ˜„์‹์„ ๋ชจ๋‘ String ๋ฌธ์ž ๊ฐ’์œผ๋กœ ์ฝ์–ด์„œ ๋ฐ˜ํ™˜ํ•ด์„œ ์‚ฌ์šฉํ•  ์ˆ˜ ์—†๊ณ , Parameter๋Š” EL ํ‘œํ˜„์‹์ด ์‚ฌ์šฉ ๋ถˆ๊ฐ€ํ•˜๋‹ค๊ณ  NiFi doc์— ๋‚˜์™€์žˆ์ง€๋งŒ ์‚ฌ์šฉ์ด ๋˜๋„ค์š”..๐Ÿคจ 1. Parameter์—์„œ EL ์‚ฌ์šฉ ๋ฐฉ๋ฒ•์ž…๋‹ˆ๋‹ค. 2. ํ”„๋กœ์„ธ์Šค ๊ทธ๋ฃน์— ํŒŒ๋ผ๋ฏธํ„ฐ ๋งคํ•‘ 3.. 2022. 8. 31.
[zeppelin] python interpreter ์„ค์น˜ ๋ฐ ์—ฐ๋™ ํด๋ผ์šฐ๋ฐ๋ผ์—์„œ zeppelin ์„œ๋น„์Šค์— python interpreter ์„ค์น˜ ๋ฐ ์—ฐ๋™ํ•˜๋Š” ๋ฐฉ๋ฒ•์ž…๋‹ˆ๋‹ค. clouder doc์—๋Š” ๋‚˜์™€์žˆ์ง€ ์•Š์•„ ๊ณผ๊ฑฐ HDP์™€ Apache Zeppelin ๋ฌธ์„œ ์ฐธ๊ณ  ํ•˜์˜€์Šต๋‹ˆ๋‹ค ๐Ÿ™‚ 1. zeppelin ์„ค์น˜๋œ ์„œ๋ฒ„ ์ ‘์† ํ›„ ํŒŒ์ด์ฌ ์ธํ„ฐํ”„๋ฆฌํ„ฐ ์„ค์น˜ ์„ค์น˜ ์™„๋ฃŒ๋˜๋ฉด /opt/cloudera/parcels/CDH/lib/zeppelin/interpreter/python ๊ฒฝ๋กœ๊ฐ€ ์ƒ์„ฑ๋จ ํ•˜์œ„ ๊ฒฝ๋กœ ๊ถŒํ•œ ํ™•์ธ (chmod 644) /opt/cloudera/parcels/CDH/lib/zeppelin/bin/install-interpreter.sh -n python 2. zeppelin web-ui > interpreter > create python interpreter๊ฐ€ ์ •์ƒ ์„ค์น˜๋˜๋ฉด.. 2022. 8. 18.
[zeppelin] Authentication failed for PAM. ๐Ÿšซ ERROR Exception in login: org.apache.shiro.authc.AuthenticationException: Authentication failed for PAM. Caused by: org.jvnet.libpam.PAMException: pam_authenticate failed : Authentication failure ๐Ÿ’ก SOLVED ## check shiro.ini: -------------------------------------------- pamRealm=org.apache.zeppelin.realm.PamRealm pamRealm.service=sshd -------------------------------------------- ## set acl $ se.. 2022. 8. 16.
[๋ฆฌ๋ˆ…์Šค] ์—ฌ๋Ÿฌ jar ํŒŒ์ผ์•ˆ์— class ๋ชฉ๋ก ํ•œ๋ฒˆ์— ์ถœ๋ ฅ ll | grep log4j | awk '{print $9}' | xargs -d '\n' -n 1 jar -tvf find . -name "*.jar" -exec echo ==\ {} \; -exec jar tf {} \;|grep -E "==|HiveMetaStore" 2022. 8. 4.