<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Commenti a: Support Vector Clustering Code</title>
	<atom:link href="http://thesis.neminis.org/2007/12/03/support-vector-clustering-code/feed/" rel="self" type="application/rss+xml" />
	<link>http://thesis.neminis.org/2007/12/03/support-vector-clustering-code/</link>
	<description>Diario di lavoro della tesi di Vincenzo Russo / Work-log of Vincenzo Russo’s Thesis</description>
	<lastBuildDate>Tue, 01 Mar 2011 08:32:01 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.2.1</generator>
	<item>
		<title>Di: Rimjhim</title>
		<link>http://thesis.neminis.org/2007/12/03/support-vector-clustering-code/comment-page-1/#comment-2233</link>
		<dc:creator>Rimjhim</dc:creator>
		<pubDate>Tue, 01 Mar 2011 08:32:01 +0000</pubDate>
		<guid isPermaLink="false">http://thesis.neminis.org/2007/12/03/support-vector-clustering-code/#comment-2233</guid>
		<description>Sir,
As you have mentioned in your website about the development of  a completely re-engineered version of the SVC  software, which will be based on LIBSVM Plus, can you please tell me, when you are going to upload the new version of SVC software. 
Also I am not able to get the proper clustering on my data using  current version of your software , as there is a difficulty in selecting the q/C values.
 I am also unable to find out how to modify the code ,if we wants to change the stopping criteria which is based on number of clusters.

regards,
rimjhim</description>
		<content:encoded><![CDATA[<p>Sir,<br />
As you have mentioned in your website about the development of  a completely re-engineered version of the SVC  software, which will be based on LIBSVM Plus, can you please tell me, when you are going to upload the new version of SVC software.<br />
Also I am not able to get the proper clustering on my data using  current version of your software , as there is a difficulty in selecting the q/C values.<br />
 I am also unable to find out how to modify the code ,if we wants to change the stopping criteria which is based on number of clusters.</p>
<p>regards,<br />
rimjhim</p>
]]></content:encoded>
	</item>
	<item>
		<title>Di: Vincenzo Russo</title>
		<link>http://thesis.neminis.org/2007/12/03/support-vector-clustering-code/comment-page-1/#comment-67</link>
		<dc:creator>Vincenzo Russo</dc:creator>
		<pubDate>Wed, 18 Feb 2009 13:49:04 +0000</pubDate>
		<guid isPermaLink="false">http://thesis.neminis.org/2007/12/03/support-vector-clustering-code/#comment-67</guid>
		<description>&lt;strong&gt;UPDATE 18th of Feb, 2008&lt;/strong&gt;: the official page of this software is now &lt;a href=&quot;http://neminis.org/software/support-vector-clustering/&quot; rel=&quot;nofollow&quot;&gt;located at my official website&lt;/a&gt;.

Please use that page instead of this one, even if I am leaving the comments here open.</description>
		<content:encoded><![CDATA[<p><strong>UPDATE 18th of Feb, 2008</strong>: the official page of this software is now <a href="http://neminis.org/software/support-vector-clustering/" rel="nofollow">located at my official website</a>.</p>
<p>Please use that page instead of this one, even if I am leaving the comments here open.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Di: Vincenzo Russo</title>
		<link>http://thesis.neminis.org/2007/12/03/support-vector-clustering-code/comment-page-1/#comment-66</link>
		<dc:creator>Vincenzo Russo</dc:creator>
		<pubDate>Wed, 18 Feb 2009 13:44:39 +0000</pubDate>
		<guid isPermaLink="false">http://thesis.neminis.org/2007/12/03/support-vector-clustering-code/#comment-66</guid>
		<description>Dear Mamatha,

 that&#039;s a well known problem and, unfortunately, since I have changed a lot of things in my life, I am still working on the new release of the code.

That means that you can actually use this code on Linux and Mac OS X systems only. Sorry.

Regarding the &quot;suffix tree clustering algorithm&quot; you&#039;ve mentioned, what exactly do you mean? I never developed  nothing related to such a name, I guess.

Regards,

    Vincenzo.


PS: the official page of this software is now this one:

   http://neminis.org/software/support-vector-clustering/</description>
		<content:encoded><![CDATA[<p>Dear Mamatha,</p>
<p> that&#8217;s a well known problem and, unfortunately, since I have changed a lot of things in my life, I am still working on the new release of the code.</p>
<p>That means that you can actually use this code on Linux and Mac OS X systems only. Sorry.</p>
<p>Regarding the &#8220;suffix tree clustering algorithm&#8221; you&#8217;ve mentioned, what exactly do you mean? I never developed  nothing related to such a name, I guess.</p>
<p>Regards,</p>
<p>    Vincenzo.</p>
<p>PS: the official page of this software is now this one:</p>
<p>   <a href="http://neminis.org/software/support-vector-clustering/" rel="nofollow">http://neminis.org/software/support-vector-clustering/</a></p>
]]></content:encoded>
	</item>
	<item>
		<title>Di: mamatha</title>
		<link>http://thesis.neminis.org/2007/12/03/support-vector-clustering-code/comment-page-1/#comment-65</link>
		<dc:creator>mamatha</dc:creator>
		<pubDate>Wed, 18 Feb 2009 13:36:09 +0000</pubDate>
		<guid isPermaLink="false">http://thesis.neminis.org/2007/12/03/support-vector-clustering-code/#comment-65</guid>
		<description>Respected sir,
                        I R.Mamatha stuying M.Tech final year in V.I.T ,Vellore,Tamilnadu,India doing my 9 months project.
                            I am unalbe to run the code on windows because of some headerfiles which i am not understanding how to include and also sir is it possible to give the code for &quot;suffix tree clustering algorithm&quot; which i want  to use for my project.</description>
		<content:encoded><![CDATA[<p>Respected sir,<br />
                        I R.Mamatha stuying M.Tech final year in V.I.T ,Vellore,Tamilnadu,India doing my 9 months project.<br />
                            I am unalbe to run the code on windows because of some headerfiles which i am not understanding how to include and also sir is it possible to give the code for &#8220;suffix tree clustering algorithm&#8221; which i want  to use for my project.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Di: Domus Neminis &#8212; Announcing LIBSVM Plus</title>
		<link>http://thesis.neminis.org/2007/12/03/support-vector-clustering-code/comment-page-1/#comment-64</link>
		<dc:creator>Domus Neminis &#8212; Announcing LIBSVM Plus</dc:creator>
		<pubDate>Mon, 23 Jun 2008 12:19:25 +0000</pubDate>
		<guid isPermaLink="false">http://thesis.neminis.org/2007/12/03/support-vector-clustering-code/#comment-64</guid>
		<description>[...] basic for the next generation of the Support Vector Clustering (SVC) library, which will replace the SVC software I developed for my master thesis and that will have a lot of new [...]</description>
		<content:encoded><![CDATA[<p>[...] basic for the next generation of the Support Vector Clustering (SVC) library, which will replace the SVC software I developed for my master thesis and that will have a lot of new [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>Di: Vincenzo Russo</title>
		<link>http://thesis.neminis.org/2007/12/03/support-vector-clustering-code/comment-page-1/#comment-63</link>
		<dc:creator>Vincenzo Russo</dc:creator>
		<pubDate>Tue, 29 Apr 2008 07:58:48 +0000</pubDate>
		<guid isPermaLink="false">http://thesis.neminis.org/2007/12/03/support-vector-clustering-code/#comment-63</guid>
		<description>Dear Lawrence,

   Sorry for the late. I was not able to connect for a week because I was not at home.

To be short, this is what I would do:

1. Run K-means several times with different &#039;k&#039; values and choose the best instance. &quot;Best&quot; here means the instance that produce the best value for some validity index (C-index are supposed to  be a good choice for K-means).

2. Run SVC several times with different parameters settings (kernel, C, q, etc.) and choose the best instance according to a validity index (the best choice is the index specifically developed for SVC, I guess).

3. Compare the results of the &quot;best k-means instance&quot; and the &quot;best SVC instance&quot;.

That&#039;s all.

And no, I am sure that the validity index developed for the SVC does not fit the K-means because the index relies on specific characteristics of the SVC.

I hope I am of help.

Best,

   VR.</description>
		<content:encoded><![CDATA[<p>Dear Lawrence,</p>
<p>   Sorry for the late. I was not able to connect for a week because I was not at home.</p>
<p>To be short, this is what I would do:</p>
<p>1. Run K-means several times with different &#8216;k&#8217; values and choose the best instance. &#8220;Best&#8221; here means the instance that produce the best value for some validity index (C-index are supposed to  be a good choice for K-means).</p>
<p>2. Run SVC several times with different parameters settings (kernel, C, q, etc.) and choose the best instance according to a validity index (the best choice is the index specifically developed for SVC, I guess).</p>
<p>3. Compare the results of the &#8220;best k-means instance&#8221; and the &#8220;best SVC instance&#8221;.</p>
<p>That&#8217;s all.</p>
<p>And no, I am sure that the validity index developed for the SVC does not fit the K-means because the index relies on specific characteristics of the SVC.</p>
<p>I hope I am of help.</p>
<p>Best,</p>
<p>   VR.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Di: Lawrence</title>
		<link>http://thesis.neminis.org/2007/12/03/support-vector-clustering-code/comment-page-1/#comment-62</link>
		<dc:creator>Lawrence</dc:creator>
		<pubDate>Sun, 20 Apr 2008 15:59:01 +0000</pubDate>
		<guid isPermaLink="false">http://thesis.neminis.org/2007/12/03/support-vector-clustering-code/#comment-62</guid>
		<description>Dear Vincenzo Russo，

   I have already have some idea on the evaluation of clustering results now. As you mentioned, from a review of the literature, generally there are three approaches, internal criteria, external criteria, and relative criteria, have been used for the quantitative evaluation of clustering results in many of the research studies.

   Internal criteria are the only means to evaluate the clustering quality of a completely new domain.

   External criteria imply clustering evaluation by means of external pre-specified structure information of a dataset. Generally, several real-life or synthetic datasets with prior class information such as Iris data, Wisconsin’s breast cancer database, and Spam database, are used as evaluation benchmark for clustering results.

   Relative criteria (also known as validity index) evaluate clustering results with different input parameter settings of a same clustering algorithm. A number of validity indices have been developed and proposed in literature.

  If i want to compare the performance of K-means and SVC in a completely new domain without prior class information. There are two methods as follows:

1.In the process of running K-means and SVC, validity index is embeded respectively for parameter selection. When the optimal clusering resluts are found respectively, the clstering resluts are evaluated by internal criteria.

2.There is a validity index specific for SVC which is quite useful for parameter selection as you mentioned (I don&#039;t know if it can used for K-means). If different validity indices are used in K-means and SVC, I think they can&#039;t be compared. If the same validity index is used in both clustering algorithms, regardless of it is not appropriate for SVC, can the index value work as a crtieria of the performance of these two clustering algorithms?

Maybe I didn&#039;t well express my question. I hope you can understand. I am a little confused here.


Many thanks and Best regards

Lawrrence</description>
		<content:encoded><![CDATA[<p>Dear Vincenzo Russo，</p>
<p>   I have already have some idea on the evaluation of clustering results now. As you mentioned, from a review of the literature, generally there are three approaches, internal criteria, external criteria, and relative criteria, have been used for the quantitative evaluation of clustering results in many of the research studies.</p>
<p>   Internal criteria are the only means to evaluate the clustering quality of a completely new domain.</p>
<p>   External criteria imply clustering evaluation by means of external pre-specified structure information of a dataset. Generally, several real-life or synthetic datasets with prior class information such as Iris data, Wisconsin’s breast cancer database, and Spam database, are used as evaluation benchmark for clustering results.</p>
<p>   Relative criteria (also known as validity index) evaluate clustering results with different input parameter settings of a same clustering algorithm. A number of validity indices have been developed and proposed in literature.</p>
<p>  If i want to compare the performance of K-means and SVC in a completely new domain without prior class information. There are two methods as follows:</p>
<p>1.In the process of running K-means and SVC, validity index is embeded respectively for parameter selection. When the optimal clusering resluts are found respectively, the clstering resluts are evaluated by internal criteria.</p>
<p>2.There is a validity index specific for SVC which is quite useful for parameter selection as you mentioned (I don&#8217;t know if it can used for K-means). If different validity indices are used in K-means and SVC, I think they can&#8217;t be compared. If the same validity index is used in both clustering algorithms, regardless of it is not appropriate for SVC, can the index value work as a crtieria of the performance of these two clustering algorithms?</p>
<p>Maybe I didn&#8217;t well express my question. I hope you can understand. I am a little confused here.</p>
<p>Many thanks and Best regards</p>
<p>Lawrrence</p>
]]></content:encoded>
	</item>
	<item>
		<title>Di: Lawrence</title>
		<link>http://thesis.neminis.org/2007/12/03/support-vector-clustering-code/comment-page-1/#comment-61</link>
		<dc:creator>Lawrence</dc:creator>
		<pubDate>Thu, 17 Apr 2008 13:25:50 +0000</pubDate>
		<guid isPermaLink="false">http://thesis.neminis.org/2007/12/03/support-vector-clustering-code/#comment-61</guid>
		<description>Dear Vincenzo Russo，

  Thank you very very much. It seems clear to me. I&#039;d like to read relevant parts of your thesis and the paper suggested by you first. You really give me a great inspiration on how to evaluate the quality of clustering results when datasets are unlabeled. It&#039;s great appreciated for your help.

   Best Regards

   Lawrence</description>
		<content:encoded><![CDATA[<p>Dear Vincenzo Russo，</p>
<p>  Thank you very very much. It seems clear to me. I&#8217;d like to read relevant parts of your thesis and the paper suggested by you first. You really give me a great inspiration on how to evaluate the quality of clustering results when datasets are unlabeled. It&#8217;s great appreciated for your help.</p>
<p>   Best Regards</p>
<p>   Lawrence</p>
]]></content:encoded>
	</item>
	<item>
		<title>Di: Vincenzo Russo</title>
		<link>http://thesis.neminis.org/2007/12/03/support-vector-clustering-code/comment-page-1/#comment-60</link>
		<dc:creator>Vincenzo Russo</dc:creator>
		<pubDate>Thu, 17 Apr 2008 08:14:59 +0000</pubDate>
		<guid isPermaLink="false">http://thesis.neminis.org/2007/12/03/support-vector-clustering-code/#comment-60</guid>
		<description>Dear Lawrence,

   welcome back.

  Since your datasets are unlabeled, the first thing you need are some criteria to evaluate the quality of clustering results. The most effective way to evaluate clustering results when data are unlabeled, is a relative criterion (aka validity index).

In the section 3.3.3 of my thesis I present some of classical validity indices and in

&lt;a href=&quot;http://portal.acm.org/citation.cfm?id=1294369.1294523&amp;coll=GUIDE&amp;dl=GUIDE&amp;CFID=15151515&amp;CFTOKEN=6184618&quot; rel=&quot;nofollow&quot;&gt;J. Wang and J. Chiang, &quot;A cluster validity measure with a hybrid parameter search method for the support vector clustering algorithm,&quot; Pattern Recognition, vol. 41, iss. 2, pp. 506-520, 2008.&lt;/a&gt;

you can found a validity index specific for SVC. I used it to develop a new stopping criterion for SVC (not available in the software you are using).

So, you have to run the SVC and K-means several times, each with a different parameter settings and then evaluate the results. The better index value, the better the clustering. Since you probably don&#039;t know the number of clusters and K-means need it, a classical way is to run K-means with different number of clusters in input and then choose the instance that yields the best validity index value. As far as the SVC is concerned, you can try different combinations of q/C/kernel and choose the instance that yields the best value of the validity index.

I hope I was clear.

Best,
  VR.</description>
		<content:encoded><![CDATA[<p>Dear Lawrence,</p>
<p>   welcome back.</p>
<p>  Since your datasets are unlabeled, the first thing you need are some criteria to evaluate the quality of clustering results. The most effective way to evaluate clustering results when data are unlabeled, is a relative criterion (aka validity index).</p>
<p>In the section 3.3.3 of my thesis I present some of classical validity indices and in</p>
<p><a href="http://portal.acm.org/citation.cfm?id=1294369.1294523&amp;coll=GUIDE&amp;dl=GUIDE&amp;CFID=15151515&amp;CFTOKEN=6184618" rel="nofollow">J. Wang and J. Chiang, &#8220;A cluster validity measure with a hybrid parameter search method for the support vector clustering algorithm,&#8221; Pattern Recognition, vol. 41, iss. 2, pp. 506-520, 2008.</a></p>
<p>you can found a validity index specific for SVC. I used it to develop a new stopping criterion for SVC (not available in the software you are using).</p>
<p>So, you have to run the SVC and K-means several times, each with a different parameter settings and then evaluate the results. The better index value, the better the clustering. Since you probably don&#8217;t know the number of clusters and K-means need it, a classical way is to run K-means with different number of clusters in input and then choose the instance that yields the best validity index value. As far as the SVC is concerned, you can try different combinations of q/C/kernel and choose the instance that yields the best value of the validity index.</p>
<p>I hope I was clear.</p>
<p>Best,<br />
  VR.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Di: Lawrence</title>
		<link>http://thesis.neminis.org/2007/12/03/support-vector-clustering-code/comment-page-1/#comment-59</link>
		<dc:creator>Lawrence</dc:creator>
		<pubDate>Thu, 17 Apr 2008 07:36:46 +0000</pubDate>
		<guid isPermaLink="false">http://thesis.neminis.org/2007/12/03/support-vector-clustering-code/#comment-59</guid>
		<description>Dear Vincenzo Russo，

   I notice that you compared the performance of different clustering methods using real-life benchmarks such as Iris data, Wisconsin’s breast cancer database, and Wine Recognition Database in your thesis. Camastra (2005) also compared the performance of the current clustering methods including K-means, Neural Gas, Self-Organizing Map (SOM), Spectral clustering algorithm and SVC on three kinds of real-life benchmarks. However, there already are class information for these data sets, that is, we have already know the class each observation belongs. It&#039;s quite OK and necessary that these datasets are used to compare the results of clustering methods. If I want to compare the performance of SVC and K-means on a dataset without class information ahead, do you have some suggestions?


   Many thanks and Best Regards

   Lawrence</description>
		<content:encoded><![CDATA[<p>Dear Vincenzo Russo，</p>
<p>   I notice that you compared the performance of different clustering methods using real-life benchmarks such as Iris data, Wisconsin’s breast cancer database, and Wine Recognition Database in your thesis. Camastra (2005) also compared the performance of the current clustering methods including K-means, Neural Gas, Self-Organizing Map (SOM), Spectral clustering algorithm and SVC on three kinds of real-life benchmarks. However, there already are class information for these data sets, that is, we have already know the class each observation belongs. It&#8217;s quite OK and necessary that these datasets are used to compare the results of clustering methods. If I want to compare the performance of SVC and K-means on a dataset without class information ahead, do you have some suggestions?</p>
<p>   Many thanks and Best Regards</p>
<p>   Lawrence</p>
]]></content:encoded>
	</item>
</channel>
</rss>

