Jump to content

Help with homework


Recommended Posts

Posted
Just now, ring_master said:

K- means vaadu but K-means assigns cluster to it's members based on euclidean distance between clusters

tSNE is what I used for clustering in the earlier problem based on Euclidian distance. 

Posted

First use gridsearch and randomsearchcv algorithms to get the best hyper parameters

Then try K-means(n=1,2,3,....) and SVM(best model) for the clustering(playing with c,gamma values)

Hierarchial clustering before k means will also give you an estimate on optimal no of clusters

Posted
Just now, kathanayaka said:

First use gridsearch and randomsearchcv algorithms to get the best hyper parameters

Then try KNN(n=1,2,3,....) and SVM(best model) for the clustering(playing with c,gamma values)

Does that model cluster based on Euclidean distance between the clusters of the probability distribution of the cluster? 

Posted
2 minutes ago, Meowmeow said:

Does that model cluster based on Euclidean distance between the clusters of the probability distribution of the cluster? 

K means supports only Euclidean distance
 

Posted
3 minutes ago, Meowmeow said:

tSNE is what I used for clustering in the earlier problem based on Euclidian distance. 

Clustering will be based on similiarity measure... so distance is used as similarity measure . You don;t want to use distance to measure similarity? Is that you're trying to do?

Posted
2 minutes ago, kathanayaka said:

K means supports only Euclidean distance
 

Yeah I used tSNE for Euclidian distance. I need an algorithm for probability distribution of the cluster. 

Posted
Just now, Meowmeow said:

Yeah I used tSNE for Euclidian distance. I need an algorithm for probability distribution of the cluster. 

why did even tSNE come into picture

Its just for data exploration and visualizing high-dimensional data and gives an intuition of how the data is arranged ante

  • Like 1
Posted
Just now, Meowmeow said:

Yeah I used tSNE for Euclidian distance. I need an algorithm for probability distribution of the cluster. 

There are some divergence based clustering. Special cases which measure probability distribution. try kl-divergence 

or u  can simply use chi-squared 

Posted
1 minute ago, ring_master said:

Clustering will be based on similiarity measure... so distance is used as similarity measure . You don;t want to use distance to measure similarity? Is that you're trying to do?

Kind of, if I cluster based on Euclidean distance, I can only tell that the points in a single cluster are similar. But I cannot tell how different one cluster is from another cluster based on how close/far it is. So, I am trying to get that information. 

Posted
1 minute ago, kathanayaka said:

why did even tSNE come into picture

Its just for data exploration and visualizing high-dimensional data and gives an intuition of how the data is arranged ante

I am trying to visualize high dimensional data, but I want the distance between the clusters to indicate how similar/or not they are to each other. 

Posted

I might just give up this question, this is for extra credit and I think I dided everything else right. 

Posted
Just now, Meowmeow said:

I might just give up this question, this is for extra credit and I think I dided everything else right. 

Final ga nuv bussu anna mata..

Posted
5 minutes ago, Meowmeow said:

I am trying to visualize high dimensional data, but I want the distance between the clusters to indicate how similar/or not they are to each other. 

yes you need a clustering algorithm . hence K means or SVM give a shot

Posted
8 minutes ago, Meowmeow said:

Kind of, if I cluster based on Euclidean distance, I can only tell that the points in a single cluster are similar. But I cannot tell how different one cluster is from another cluster based on how close/far it is. So, I am trying to get that information. 

Ok I got it . 

Posted
3 minutes ago, kathanayaka said:

yes you need a clustering algorithm . hence K means or SVM give a shot

Arey k means clustering answer kadu ra, this is a traditional approach he thought in the class, I used this algorithm for all the earlier problems. 

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.

Guest
Reply to this topic...

×   Pasted as rich text.   Paste as plain text instead

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.

×
×
  • Create New...