[an error occurred while processing this directive]

[an error occurred while processing this directive]

<h2>Evaluation</h2>

<h3>Confusion Matrix</h3>
<p>
A confusion matrix indicates the distribution of the submitted classification result:
</p>
<center>
<table bgcolor="#DCDCDC">
<tr><th></th><td></td><th colspan="3">Predicted label</th></tr>
<tr><th></th><th></td><th>0</th><th>1</th><th>2</th></tr>
<tr><th rowspan="3">True<br>label</th>
<th>0</th><td align="center" bgcolor="#FFFFFF">a</td><td align="center" bgcolor="#FFFFFF">b</td><td align="center" bgcolor="#FFFFFF">c</td></tr>
<tr><th>1</th><td align="center" bgcolor="#FFFFFF">d</td><td align="center" bgcolor="#FFFFFF">e</td><td align="center" bgcolor="#FFFFFF">f</td></tr>
<tr><th>2</th><td align="center" bgcolor="#FFFFFF">g</td><td align="center" bgcolor="#FFFFFF">h</td><td align="center" bgcolor="#FFFFFF">i</td></tr>
</table>
</center>

<p>
0 correspond to irrelevant sentences (I), 1 to relevant (R) and 
2 to correct answers (C). For instance, "<i>b</i>" is the number of "irrelevant" examples ("0") classified as
"relevant" ("1") and "<i>e</i>" the number of correctly classified "relevant" examples.
</p>

<p>In Competition 1 only the viewed 
sentences affect the result. In Competition 2
all sentences 
are taken into account (including the unseen ones). In Competition 2
the participants should therefore make
their best guess of the relevancy of unseen sentences.</p>

<a name="accuracy">
<h3>Accuracy</h3>
</a>
<p>
Classification results for both the validation and test set are
ranked according to accuracy, the amount of correct classifications divided by
the total amount of examples in the set, or (a+e+i) divided by the sum of all
elements.  Note that the exact classification accuracy can be
calculated from the confusion matrix.
</p>

<a name="ber">
<h3>Balanced Error Rate</h3>
</a>
<p>
In addition to accuracy, Balanced Error Rate (BER) is also calculated for each submitted
result. BER is the average of the proportion of wrong classifications in each
class, or ((b+c)/(a+b+c)+(d+f)/(d+e+f)+(g+h)/(g+h+i))/3. BER is used
to determine the winner in the unlikely case that two (or more) contestants have
obtained an equal accuracy.
</p>


[an error occurred while processing this directive]