A common need, nicely and simply put here in KDNuggets. Very classic and should be done with most every set of data you are seriously working with. With more tech and coding at the link:
Four Techniques for Outlier Detection
Tags: DBSCAN, Knime, Outliers, Python
There are many techniques to detect and optionally remove outliers from a dataset. In this blog post, we show an implementation in KNIME Analytics Platform of four of the most frequently used - traditional and novel - techniques for outlier detection. By Maarit Widmann, Moritz Heine, Rosaria Silipo, Data Scientists at KNIME
Anomalies, or outliers, can be a serious issue when training machine learning algorithms or applying statistical techniques. They are often the result of errors in measurements or exceptional system conditions and therefore do not describe the common functioning of the underlying system. Indeed, the best practice is to implement an outlier removal phase before proceeding with further analysis.
But hold on there! In some cases, outliers can give us information about localized anomalies in the whole system; so the detection of outliers is a valuable process because of the additional information they can provide about your dataset.
There are many techniques to detect and optionally remove outliers from a dataset. In this blog post, we show an implementation in KNIME Analytics Platform of four of the most frequently used - traditional and novel - techniques for outlier detection. .... "
No comments:
Post a Comment