Important Notice: Our web hosting provider recently started charging us for additional visits, which was unexpected. In response, we're seeking donations. Depending on the situation, we may explore different monetization options for our Community and Expert Contributors. It's crucial to provide more returns for their expertise and offer more Expert Validated Answers or AI Validated Answers. Learn more about our hosting issue here.

What is the best distance measure to be used while clustering?

April 26, 2017best clustering distance measure Used

0

10 Posted

What is the best distance measure to be used while clustering?

1 Answer

0

10 Posted

A. The choice of the distance measure depends on the area of application and the sort of similarities one would like to detect. For example, if the gene expression measurements for all samples in one gene are three times the expression measurements in the other gene, those two genes would be considered distant using Euclidean distance metric, but close using correlation coefficient (because correlation coefficient considers only change pattern). Manhattan distance is more robust against outliers. Euclidean distance is the preferred one to successfully group similar data items.