728x90

machine learning 2

[Machine Learning] SMOTETomek

- SMOTETomek? Combination of over - and under - sampling method SMOTE์˜ ๋ฐฉ๋ฒ•๊ณผ TomekLink๋ฅผ ๋ณตํ•ฉํ•˜์—ฌ ์ง„ํ–‰ํ•˜๋Š” ๊ฒƒ SMOTE๋กœ over sampling ์ง„ํ–‰ ํ›„ ๊ฒฝ๊ณ„์„ ์— ์žˆ๋Š” major sample์„ ์ œ๊ฑฐ ๋ถ„๋ฅ˜ ๊ฒฝ๊ณ„๋ฉด์„ ๋šœ๋ ทํ•˜๊ฒŒํ•˜์—ฌ ๋ถ„๋ฅ˜๊ฐ€ ์ž˜ ๋  ์ˆ˜ ์žˆ๋„๋ก ํ•œ๋‹ค. - Import import pandas as pd import matplotlib.pyplot as plt import seaborn as sns from sklearn.datasets import make_classification from imblearn.under_sampling import TomekLinks from imblearn.combine import SMOTETom..

Machine Learning 2022.11.23

[Machine Learning]PCA๋กœ cluster ๊ทธ๋ž˜ํ”„ ๊ทธ๋ฆฌ๊ธฐ

๋‹ค์ฐจ์›์˜ ๋ณ€์ˆ˜๋ฅผ 2์ฐจ์›์˜ ๊ทธ๋ž˜ํ”„๋กœ ๋‚˜ํƒ€๋‚ด๊ธฐ ์œ„ํ•ด์„œ๋Š” PCA๋ฅผ ์‚ฌ์šฉํ•ด์•ผ ํ•ฉ๋‹ˆ๋‹ค. ์˜ค๋Š˜์€ PCA๋กœ ๋ณ€์ˆ˜๋ฅผ ์••์ถ• ํ›„ target์ด ์ž˜ ๋ถ„๋ฆฌ๋˜์–ด ์žˆ๋Š”์ง€ ๊ทธ๋ž˜ํ”„๋กœ ๋‚˜ํƒ€๋‚ด๊ฒ ์Šต๋‹ˆ๋‹ค. - Data Load ๋ฐ Import iris data๋ฅผ ์‚ฌ์šฉํ•˜์—ฌ ์ง„ํ–‰ํ•˜๊ฒ ์Šต๋‹ˆ๋‹ค. import pandas as pd import matplotlib.pyplot as plt import seaborn as sns from sklearn.datasets import load_iris from sklearn.decomposition import PCA data = load_iris() ์ด data๋Š” dictionary ํ˜•ํƒœ๋กœ ๋˜์–ด ์žˆ์œผ๋ฉฐ, keys๋ฅผ ํ†ตํ•ด ์–ด๋–ค ๋ฐ์ดํ„ฐ ํ•ญ๋ชฉ์ด ์žˆ๋Š”์ง€ ์•Œ ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค. data.keys() >>> dict_k..

Machine Learning 2022.11.11
728x90