曹同學(xué)
2023-04-16 10:55可以解釋一下什么是data mining嗎,我沒聽懂,最好能有一個(gè)英文的定義
所屬:CFA Level III > Equity Portfolio Management 視頻位置 相關(guān)試題
來源: 視頻位置 相關(guān)試題
1個(gè)回答
開開助教
2023-04-17 11:24
該回答已被題主采納
同學(xué)你好,Data mining是反復(fù)地搜索數(shù)據(jù)集,直至出現(xiàn)顯著的模式。這些數(shù)據(jù)本不存在相關(guān)性或者特定范式,但由于你不停的抽樣或者搜索數(shù)據(jù),就會(huì)偶然間突然出現(xiàn)一些數(shù)據(jù)會(huì)存在特定模式。這就是data mining bias,他并不存在經(jīng)濟(jì)原理也不符合邏輯,僅僅是由于過度搜索數(shù)據(jù)集從而偶然間出現(xiàn)了數(shù)據(jù)上的顯著模式。
原版書中定義為:
Data-mining bias arises from repeatedly searching a dataset until a statistically significant pattern emerges. It is almost inevitable that some relationship will appear. Such patterns cannot be expected to have predictive value. Lack of an explicit economic rationale for a variable’s usefulness is a warning sign of a data-mining problem: no story, no future.6 Of course, the analyst must be wary of inventing the story after discovering
【點(diǎn)贊】喲~。加油,祝你順利通過考試~
