Implements a probabilistic measure for comparing two words. More...
#include <SimilarityMeasures.h>
Public Member Functions | |
WordPairComparison (int rank, const StringSimilarityMeasure *sm) | |
Constructor. | |
vector< double > | distrEstimate (int minLen, int maxLen, int nSamples=1000) const |
Estimates the distribution of the distances (similarities) between two randomly generated words. | |
double | comparePair (const Word &w1, const Word &w2, const vector< double > &measureDistr) const |
Compare the pair of words using probabilistic measure. | |
double | comparePair (const Word &w1, const Word &w2) const |
Compare the pair of words using similarity measure. | |
Private Member Functions | |
WordPairComparison (const WordPairComparison &) | |
WordPairComparison & | operator= (const WordPairComparison &) |
Private Attributes | |
const StringSimilarityMeasure * | pSSM |
int | theRank |
Implements a probabilistic measure for comparing two words.
Basic Idea: ...
Definition at line 37 of file SimilarityMeasures.h.
WordPairComparison::WordPairComparison | ( | int | rank, | |
const StringSimilarityMeasure * | sm | |||
) | [inline] |
Constructor.
rank | - the rank of a free group | |
sm | - pointer to the corresponding similarity measure. |
Definition at line 45 of file SimilarityMeasures.h.
WordPairComparison::WordPairComparison | ( | const WordPairComparison & | ) | [private] |
Compare the pair of words using similarity measure.
w1 | - the first word | |
w2 | - the second word. |
double WordPairComparison::comparePair | ( | const Word & | w1, | |
const Word & | w2, | |||
const vector< double > & | measureDistr | |||
) | const |
Compare the pair of words using probabilistic measure.
w1 | - the first word | |
w2 | - the second word. | |
measureDistr | - distribution of the distance between two random words |
w1
and w2
been generated independently vector<double> WordPairComparison::distrEstimate | ( | int | minLen, | |
int | maxLen, | |||
int | nSamples = 1000 | |||
) | const |
Estimates the distribution of the distances (similarities) between two randomly generated words.
minLen | - the minimal length of a randomly generated word | |
maxLen | - the maximal length of a randomly generated word | |
nSamples | - the number of pair samples to be generated (1000 is the default value). |
WordPairComparison& WordPairComparison::operator= | ( | const WordPairComparison & | ) | [private] |
const StringSimilarityMeasure* WordPairComparison::pSSM [private] |
Definition at line 76 of file SimilarityMeasures.h.
int WordPairComparison::theRank [private] |
Definition at line 77 of file SimilarityMeasures.h.