KDnuggets.com (KD stands for Knowledge Discovery) is the leading source of information on Data Mining, Web Mining, Knowledge Discovery, and Decision Support Topics, including News, Software, Solutions, Companies, Jobs, Courses, Meetings, Publications, and more.
From the category archives:
Classification
[OT] Star galaxies separation via SVM/CVM classification - Part 2
This is a modification of the experiments in this post.
I rapidly built a new training set and this time I use only this training set for training the SVM/CVM. Than, I test the new trained classifier on all three dataset of the previous post.
The training set contain 500 points and has been built using stars and galaxies from another portion of sky.
New accuracy results (SVM)
Longo 01: 95,96 %
Longo 02: 98,08 %
Longo 03: 97,956 %
New accuracy results (CVM)
Longo 01: 96,31 %
Longo 02: 97,67 %
Longo 03: 97,138 %
Let us consider the Longo 02 tested with CVM. We have
Completeness for Stars: 98,4 %
Contamination for Stars: 4,7 %
Completeness for Galaxies: 95,4 %
Contamination for Galaxies: 1,5 %
{ 0 comments }
[OT] Star galaxies separation via SVM/CVM classification
We have used some astrophysics star/galaxies datasets for our clustering problems, because they have heavily overlapping clusters.
Here we present some results of an SVM classification performed on the same datasets. In fact, S/G separation is usually faced in a supervised way.
We have used a simple nonlinear SVM/CVM classifier with a linear kernel (K(x,y) = x’ * y).
For each dataset, we have used 5% of it as training set. The rest is the test set.
Datasets:
Longo 01, 2500 items, 2000 stars, 500 galaxies
Longo 02, 9816 items, 2935 stars, 6883 galaxies
Longo 03, 10940 items, 2978 stars, 7964 galaxies
Accuracy results:
Longo 01: 95%
Longo 02: 98,0746%
Longo 03: 97,925%
Accuracy results with CVM:
Longo 01: 94,98%
Longo 02: 97,5%
Longo 03: 95,2%
Probably, other kernels could lead to better results, but it is necessary to understand in which way tune the hyperparameters, such as the kernel width and the soft margin constant, etc.
{ 0 comments }