Structural Measures
Structural measures analyze the topology and geometry of the dataset.
N1 - Fraction of Borderline Points
Fraction of points on the class boundary using MST.
Interpretation: - Higher values indicate more boundary complexity
N2 - Ratio of Intra/Extra Class Nearest Neighbor Distance
Compares distances within and between classes.
Interpretation: - Lower values indicate better separation
T1 - Fraction of Hyperspheres Covering Data
Measures how many hyperspheres are needed to cover the data.
Interpretation: - Higher values indicate more complex structure
DBC - Distance-Based Complexity
Measures complexity based on distance distributions.
LSC - Local Set Cardinality
Measures the average size of local neighborhoods.
Clust - Clustering Measure
Measures how well data forms clusters.
Interpretation: - Higher values indicate more distinct clusters
NSG - Number of Spanning Graphs
Counts the number of connected components.
ICSV - Inter-Class to Intra-Class Similarity Variance
Compares inter-class and intra-class variance.
ONB - Overlap of Neighborhoods Between Classes
Measures neighborhood overlap between classes.
Example: Analyze All Structural Measures
cm = ComplexityMeasures(X, y)
structural = cm.get_all_complexity_measures(measures='structural')
for measure, value in structural.items():
print(f"{measure}: {value:.4f}")