Computer Science/Data Science

[Machine Learning] ์•™์ƒ๋ธ” ๊ธฐ๋ฒ•์ด๋ž€?

_cactus 2022. 6. 20. 20:36
๋ฐ˜์‘ํ˜•

Ensemble ๊ธฐ๋ฒ•

Ensemble Learning์ด๋ž€

  • ์—ฌ๋Ÿฌ๊ฐœ์˜ ๋ถ„๋ฅ˜๊ธฐ๋ฅผ ์ƒ์„ฑํ•˜๊ณ  ๊ทธ ์˜ˆ์ธก์„ ๊ฒฐํ•ฉํ•˜์—ฌ ๋ณด๋‹ค ์ •ํ™•ํ•œ ์˜ˆ์ธก์„ ๋‚ด๋Š” ๊ธฐ๋ฒ•
  • ๊ฐ•๋ ฅํ•œ ํ•˜๋‚˜์˜ ๋ชจ๋ธ์„ ์‚ฌ์šฉํ•˜๋Š” ๋Œ€์‹  ๋ณด๋‹ค ์•ฝํ•œ ๋ชจ๋ธ์„ ์—ฌ๋Ÿฌ๊ฐœ ์กฐํ•ฉํ•˜๋Š” ๋ฐฉ์‹

 

Ensemble Learning ์ข…๋ฅ˜

์•™์ƒ๋ธ” ํ•™์Šต์€ 3๊ฐ€์ง€ ์œ ํ˜•์œผ๋กœ ๋ถ„๋ฅ˜๋จ

  1. Voting
  2. Bagging
  3. Boosting

 

Voting

  • ์—ฌ๋Ÿฌ๊ฐœ์˜ classifier๊ฐ€ ํˆฌํ‘œ๋ฅผ ํ†ตํ•ด ์ตœ์ข… ์˜ˆ์ธก๊ฒฐ๊ณผ ๊ฒฐ์ •
  • ์„œ๋กœ ๋‹ค๋ฅธ ์•Œ๊ณ ๋ฆฌ์ฆ˜ ์—ฌ๋Ÿฌ๊ฐœ ๊ฒฐํ•ฉํ•˜์—ฌ ์‚ฌ์šฉ
  • Voting ๋ฐฉ์‹
    • Hard Voting : ๋‹ค์ˆ˜์˜ classifier๊ฐ€ ์˜ˆ์ธกํ•œ ๊ฒฐ๊ณผ๊ฐ’์„ ์ตœ์ข… ๊ฒฐ๊ณผ๋กœ ์„ ์ • (๋‹ค์ˆ˜๊ฒฐ์˜ ๋ฒ•์น™)
    • Soft Voting : ๋ชจ๋“  classifier๊ฐ€ ์˜ˆ์ธกํ•œ label๊ฐ’์˜ ๊ฒฐ์ • ํ™•๋ฅ  ํ‰๊ท ์„ ๊ตฌํ•œ ๋’ค ๊ฐ€์žฅ ํ™•๋ฅ ์ด ๋†’์€ label๊ฐ’์„ ์ตœ์ข…๊ฒฐ๊ณผ๋กœ ์„ ์ •

 

Bagging

: Bootstrap Aggregating

  • Bootstrap(๋ฐ์ดํ„ฐ ์ƒ˜ํ”Œ๋ง)์„ ํ†ตํ•ด ๋ชจ๋ธ ํ•™์Šต, ๊ฒฐ๊ณผ ์ง‘๊ณ„(Aggregate)ํ•˜๋Š” ๋ฐฉ๋ฒ•
    • Bootstrap? 
      : ๋ฐ์ดํ„ฐ ๋‚ด์—์„œ ๋ฐ˜๋ณต์ ์œผ๋กœ ์ƒ˜ํ”Œ์„ ์‚ฌ์šฉํ•˜๋Š” resampling ๊ธฐ๋ฒ•
  • ๋ชจ๋‘ ๊ฐ™์€ ์œ ํ˜•์˜ ์•Œ๊ณ ๋ฆฌ์ฆ˜ ๊ธฐ๋ฐ˜์˜ classifier ์‚ฌ์šฉ
  • ๋ฐ์ดํ„ฐ ๋ถ„ํ•  ์‹œ ์ค‘๋ณตํ—ˆ์šฉ (Bootstrap)
  • Aggregate์ง‘๊ณ„ ๋ฐฉ์‹
    • ์ด์‚ฐํ˜• ๋ฐ์ดํ„ฐ : ๋‹ค์ˆ˜๊ฒฐ ํˆฌํ‘œ๋ฐฉ์‹์œผ๋กœ ๊ฒฐ๊ณผ ์ง‘๊ณ„
    • ์—ฐ์†ํ˜• ๋ฐ์ดํ„ฐ : ํ‰๊ท ๊ฐ’ ์ง‘๊ณ„
  • ๊ณผ์ ํ•ฉ ๋ฐฉ์ง€์— ํšจ๊ณผ์ 
  • ๋Œ€ํ‘œ ์•Œ๊ณ ๋ฆฌ์ฆ˜ : Random Forest
๐Ÿ‘‡ Random Forest ์•Œ๊ณ ๋ฆฌ์ฆ˜ ์„ค๋ช…์ด ๊ถ๊ธˆํ•˜๋‹ค๋ฉด..!๐Ÿ‘‡

2021.03.08 - [Computer Science/Data Science] - Random Forest ๊ฐ„.๋‹จ.๋ช….๋ฃŒ

 

Random Forest ๊ฐ„.๋‹จ.๋ช….๋ฃŒ

Ensemble ์•™์ƒ๋ธ” ์—ฌ๋Ÿฌ ๊ฐœ์˜ ๋จธ์‹ ๋Ÿฌ๋‹ model์„ ์—ฐ๊ฒฐํ•˜์—ฌ ๊ฐ•๋ ฅํ•œ model์„ ๋งŒ๋“œ๋Š” ๊ธฐ๋ฒ• classifier/regression์— ์ „๋ถ€ ํšจ๊ณผ์  random forest์™€ gradient boosting์€ ๋‘˜๋‹ค model์„ ๊ตฌ์„ฑํ•˜๋Š” ๊ธฐ๋ณธ ์š”์†Œ๋กœ decision tree..

mac-user-guide.tistory.com

 

 

Boosting

  • ์—ฌ๋Ÿฌ๊ฐœ์˜ classifier๊ฐ€ ์ˆœ์ฐจ์ ์œผ๋กœ ํ•™์Šต ์ˆ˜ํ–‰
  • ์ด์ „ ๋ถ„๋ฅ˜๊ธฐ๊ฐ€ ์˜ˆ์ธก์„ ํ‹€๋ฆฐ ๋ฐ์ดํ„ฐ์— ๋Œ€ํ•ด์„œ ์˜ฌ๋ฐ”๋ฅด๊ฒŒ ์˜ˆ์ธกํ•  ์ˆ˜ ์žˆ๋„๋ก ๋‹ค์Œ ๋ถ„๋ฅ˜๊ธฐ์—๊ฒŒ ๊ฐ€์ค‘์น˜๋ฅผ ๋ถ€์—ฌํ•˜๋ฉด์„œ ํ•™์Šต&์˜ˆ์ธก ์ง„ํ–‰
  • ์ด๋Ÿฐ์‹์œผ๋กœ ๊ณ„์†ํ•ด์„œ ๋ถ„๋ฅ˜๊ธฐ์— ๊ฐ€์ค‘์น˜๋ฅผ ๋ถ€์ŠคํŒ…ํ•˜๋ฉฐ ํ•™์Šต์„ ์ง„ํ–‰ํ•˜๋Š” ๋ฐฉ์‹์„ “๋ถ€์ŠคํŒ… ๋ฐฉ์‹"์ด๋ผ๊ณ  ํ•จ
  • ๋ณดํ†ต ๋ถ€์ŠคํŒ… ๋ฐฉ์‹์€ ๋ฐฐ๊น…์— ๋น„ํ•ด ์„ฑ๋Šฅ์ด ์ข‹์ง€๋งŒ, ์†๋„๊ฐ€ ๋Š๋ฆฌ๊ณ  ๊ณผ์ ํ•ฉ ๋ฐœ์ƒ ๊ฐ€๋Šฅ์„ฑ ์กด์žฌ
  • ๋Œ€ํ‘œ ์•Œ๊ณ ๋ฆฌ์ฆ˜ : XGBoost, LightGBM

 

๐Ÿ‘‡ LightGBM ์•Œ๊ณ ๋ฆฌ์ฆ˜ ์„ค๋ช…์ด ๊ถ๊ธˆํ•˜๋‹ค๋ฉด..!๐Ÿ‘‡

2021.05.20 - [Computer Science/Data Science] - [Machine Learning] LightGBM์ด๋ž€? โœ” ์„ค๋ช… ๋ฐ ์žฅ๋‹จ์ 

 

[Machine Learning] LightGBM์ด๋ž€? โœ” ์„ค๋ช… ๋ฐ ์žฅ๋‹จ์ 

๐Ÿ“Œ Remind LightGBM์— ๋“ค์–ด๊ฐ€๊ธฐ์ „์— ๋ณต์Šต ๊ฒธ reminding์„ ํ•ด๋ณด์ž. Light GBM์˜ GBM์€ Gradient Boosting Model๋กœ, tree๋ฅผ ๊ธฐ๋ฐ˜์œผ๋กœ ํ•˜๋Š” ํ•™์Šต ์•Œ๊ณ ๋ฆฌ์ฆ˜์ด๋‹ค. ์ด GBM์˜ ํ•™์Šต๋ฐฉ์‹์„ ์‰ฝ๊ฒŒ๋งํ•˜๋ฉด, ํ‹€๋ฆฐ๋ถ€๋ถ„์— ๊ฐ€์ค‘์น˜๋ฅผ..

mac-user-guide.tistory.com

 

 

 

 

 

 

 

reference: http://www.dinnopartners.com/__trashed-4/

728x90
๋ฐ˜์‘ํ˜•