| Term | Antony & Cleo | Julius Caesar | The Tempest | Hamlet | Othello | Macbeth |
|---|---|---|---|---|---|---|
| Antony | 1 | 1 | 0 | 0 | 0 | 1 |
| Brutus | 1 | 1 | 0 | 1 | 0 | 0 |
| Caesar | 1 | 1 | 0 | 1 | 1 | 1 |
| Calpurnia | 0 | 1 | 0 | 0 | 0 | 0 |
| Cleopatra | 1 | 0 | 0 | 0 | 0 | 0 |
| mercy | 1 | 0 | 1 | 1 | 1 | 1 |
| worser | 1 | 0 | 1 | 1 | 1 | 0 |
| Term | Antony & Cleo | Julius Caesar | The Tempest | Hamlet | Othello | Macbeth |
|---|---|---|---|---|---|---|
| Antony | 157 | 73 | 0 | 0 | 0 | 0 |
| Brutus | 4 | 157 | 0 | 1 | 0 | 0 |
| Caesar | 232 | 227 | 0 | 2 | 1 | 1 |
| Calpurnia | 0 | 10 | 0 | 0 | 0 | 0 |
| Cleopatra | 57 | 0 | 0 | 0 | 0 | 0 |
| mercy | 2 | 0 | 3 | 5 | 5 | 1 |
| worser | 2 | 0 | 1 | 1 | 1 | 0 |
| Word | Collection freq (cf) | Document freq (df) |
|---|---|---|
| insurance | 10440 | 3997 |
| try | 10422 | 8760 |
| Term | dft | idft |
|---|---|---|
| calpurnia | 1 | 6 |
| animal | 100 | 4 |
| sunday | 1,000 | 3 |
| fly | 10,000 | 2 |
| under | 100,000 | 1 |
| the | 1,000,000 | 0 |
| Term | Antony & Cleo | Julius Caesar | The Tempest | Hamlet | Othello | Macbeth |
|---|---|---|---|---|---|---|
| Antony | 5.25 | 3.18 | 0 | 0 | 0 | 0.35 |
| Brutus | 1.21 | 6.1 | 0 | 1 | 0 | 0 |
| Caesar | 8.59 | 2.54 | 0 | 1.51 | 0.25 | 0 |
| Calpurnia | 0 | 1.54 | 0 | 0 | 0 | 0 |
| Cleopatra | 2.85 | 0 | 0 | 0 | 0 | 0 |
| mercy | 1.51 | 0 | 1.9 | 0.12 | 5.25 | 0.88 |
| worser | 1.37 | 0 | 0.11 | 4.15 | 0.25 | 1.95 |
| term | SaS | PaP | WH |
|---|---|---|---|
| affection | 115 | 58 | 20 |
| jealous | 10 | 7 | 11 |
| gossip | 2 | 0 | 6 |
| wuthering | 0 | 0 | 38 |
| term | SaS | PaP | WH |
|---|---|---|---|
| affection | 3.06 | 2.76 | 2.30 |
| jealous | 2.00 | 1.85 | 2.04 |
| gossip | 1.30 | 0 | 1.78 |
| wuthering | 0 | 0 | 2.58 |
| term | SaS | PaP | WH |
|---|---|---|---|
| affection | 0.789 | 0.832 | 0.524 |
| jealous | 0.515 | 0.555 | 0.465 |
| gossip | 0.335 | 0 | 0.405 |
| wuthering | 0 | 0 | 0.588 |
| Normalization | Document Frequency (idf) | Term Frequency (tf) |
|---|---|---|
| n (none): 1 | n (no): 1 | n (natural): tft,d |
| c (cosine): 1 / √Σw² | t (idf): log(N/dft) | l (logarithm): 1 + log(tft,d) |
| u (pivoted unique): 1/u | p (prob idf): log((N-df)/df) | a (augmented): 0.5 + ... |
| b (byte size): 1/ChLenα | b (boolean): 1 if tf > 0 | |
| L (log ave): complex log |
| Product | Doc (lnc) | Query (ltc) | Term | ||||
|---|---|---|---|---|---|---|---|
| n'lize | tf-wt | tf-raw | n'lize | idf | tf-wt | ||
| 0 | 0.52 | 1 | 1 | 0 | 2.3 | 0 | auto |
| 0 | 0 | 0 | 0 | 0.34 | 1.3 | 1 | best |
| 0.27 | 0.52 | 1 | 1 | 0.52 | 2.0 | 1 | car |
| 0.53 | 0.68 | 1.3 | 2 | 0.78 | 3.0 | 1 | insurance |