You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: posts/310325.md
+4-2
Original file line number
Diff line number
Diff line change
@@ -25,9 +25,9 @@ why? this helps us explicitly capture interactions and add non-linearity
25
25
26
26
you can cross catXcat, numXnum, catXnum, or even higher order cross (three or more features)
27
27
28
-
no we know Deep neural networks learns these interactions implicitly. the multiple layers and ReLU can approximate complet functions. however they might be inefficient to learn specific simple multiplicative crosses, requiring many neurons or layers to learn.
28
+
we know Deep neural networks learns these interactions implicitly. the multiple layers and ReLU can approximate complet functions. however they might be inefficient to learn specific simple multiplicative crosses, requiring many neurons or layers to learn.
29
29
30
-
introducing: deep & cross network. the cross network is part of DCN, specifically designed to create tehse feature crosses explicitly.
30
+
introducing: [deep & cross network](https://paperswithcode.com/method/dcn-v2). the cross network is part of DCN, specifically designed to create these feature crosses explicitly.
31
31
32
32
the formula: $X_{l+1} = x_0 * {W_l * x_l + b_l} + x_l$ is mathematically structured to compute interactions between the original input features (x_0) and the representations learned so far (x_l).
33
33
@@ -53,6 +53,8 @@ and DCN's have been [evolving](https://mlfrontiers.substack.com/p/the-rise-of-de
53
53
- Self-Mask operation to filter noise and reduce the number of parameters in the Cross Network by half
54
54
- fusion layer, multi-loss trade-off and calculation -> Tri-BCE, to provide appropriate supervision signals
55
55
56
+
here's a [github](https://github.com/shenweichen/DeepCTR-Torch?tab=readme-ov-file) repo implement them in pytorch
57
+
56
58
but when are they used in the multi-stage ranking system? they're often used in the l2 or final (reranking) stage.
0 commit comments