Skip to content

Commit 3cd9dc2

Browse files
committed
update posts
1 parent ec3f7a0 commit 3cd9dc2

File tree

2 files changed

+5
-3
lines changed

2 files changed

+5
-3
lines changed

posts/290325.md

+1-1
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,5 @@
11
---
2-
title: 'ocean view library'
2+
title: 'hmart daly city'
33
tags: 'journal'
44
date: 'Mar 29, 2025'
55
---

posts/310325.md

+4-2
Original file line numberDiff line numberDiff line change
@@ -25,9 +25,9 @@ why? this helps us explicitly capture interactions and add non-linearity
2525

2626
you can cross catXcat, numXnum, catXnum, or even higher order cross (three or more features)
2727

28-
no we know Deep neural networks learns these interactions implicitly. the multiple layers and ReLU can approximate complet functions. however they might be inefficient to learn specific simple multiplicative crosses, requiring many neurons or layers to learn.
28+
we know Deep neural networks learns these interactions implicitly. the multiple layers and ReLU can approximate complet functions. however they might be inefficient to learn specific simple multiplicative crosses, requiring many neurons or layers to learn.
2929

30-
introducing: deep & cross network. the cross network is part of DCN, specifically designed to create tehse feature crosses explicitly.
30+
introducing: [deep & cross network](https://paperswithcode.com/method/dcn-v2). the cross network is part of DCN, specifically designed to create these feature crosses explicitly.
3131

3232
the formula: $X_{l+1} = x_0 * {W_l * x_l + b_l} + x_l$ is mathematically structured to compute interactions between the original input features (x_0) and the representations learned so far (x_l).
3333

@@ -53,6 +53,8 @@ and DCN's have been [evolving](https://mlfrontiers.substack.com/p/the-rise-of-de
5353
- Self-Mask operation to filter noise and reduce the number of parameters in the Cross Network by half
5454
- fusion layer, multi-loss trade-off and calculation -> Tri-BCE, to provide appropriate supervision signals
5555

56+
here's a [github](https://github.com/shenweichen/DeepCTR-Torch?tab=readme-ov-file) repo implement them in pytorch
57+
5658
but when are they used in the multi-stage ranking system? they're often used in the l2 or final (reranking) stage.
5759

5860
the architecture is usually:

0 commit comments

Comments
 (0)