feature(notizen): add notes from l2 morning

2026-04-30 10:59:51 +02:00
parent 5a7f4dfe38
commit 96a9f2e550
1 changed files with 29 additions and 5 deletions
@@ -126,13 +126,37 @@ Bewertung:
        - Bsp. Welche Seiten die der Mensch (Gold Standard) als relevant klassifiziert hat, werden tatsächlich angezeigt?
            - Perfekt wenn all relevanten Seiten angezeigt wurden
            - Schlecht wenn keine relevanten Seiten gefunden wurden
 - Erweiterte Metrik: Confusion Matrix
 - Precision vs Recall
    - There is often a trafe-off between Precision and Recall
    - improving the algorithm towards one weakens the other
        - Will ich das Modell in richtung Precision verbessern, wird der Recall schlechter und umgekehrt
        - Entweder das eine oder andere kann optimiert werden
        - Bspw. Suchmaschine: Einfach alles anzeigen, dann gibts keine False Negatives weil das Gesuchte immer gefunden wird
            - Die Precision wird dabei aber sehr sehr schlecht, weil ganz viele False Positives dabei sind
            - 100% Recall 0% Precision
        - Oft muss ein Kompromiss getroffen werden zwischen Precision und Recall
            - Die Entscheidung was optimiert werden soll, muss vom Entwicklungsteam getroffen werden
        - Precision-oriented users
            - Web Surfers
        - Recall-oriented users
            - Professional searches, legal, etc
 - Dafür gibt es aber folgendes Hilfsmittel: **F-measure**
    - Das gewichtete, harmonische Mittel zwischen Precision und Recall
        - Formel: Skript Seite 7
        - F = 1/( alpha* 1/P + [1-alpha] * 1/R) = (beta^2 + 1)PR / (beta^2P+R) = beta^2 = 1 - alpha / alpha
    - Ist parametrisierbar
        - Beta < emphasize precision
        - Beta > emphasize recall
 ### Other metrics 
-
+- the generalization of our binary classifier result matrix (classification result vs. gold standard) is called a confusion matrix
-
+    - many different metrics can be derived from this 
-
+        - https.//en.wikipedia.org/wiki/Confusion_matrix
-
+    - other widely used metrics include ROC, K-S, gail/lift, ...
-
+- for specific ML problems and algorithms many additional metrics exists