ColBERT 변환#2
Open
sigridjineth wants to merge 3 commits into
Open
Conversation
8e5e3bd to
5bc774d
Compare
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Position IDs를 attention_mask 기반으로 바꾼 것
→ L0부터 끝까지 수학적 정합성 확보(가장 큰 근본 원인 제거).
PyTorch‑정확 LayerNorm(TorchLayerNormTF1)
→ LN의 미세 오차 제거. embedding 레벨과 모든 블록 잔차경로의 수치 일치 보장.
TF1 그래프 + feed‑assign + 커스텀 Saver(변수 전수집)
→ 그래프에 대형 Const 노드가 남지 않고, 모든 변수가 SavedModel에 포함.
→ TF Serving/Java 호환성과 배포 안정성의 핵심.
Export 단계 Sanity Check(임베딩+LN L0 assert)
→ 잘못된 변환물 저장 자체를 차단하는 마지막 안전장치.
ColBERT head 완전 일치(가중치/바이어스/마스킹/CLS 제외)
→ 실제 검색 품질과 직결되는 헤드가 PT와 수치적으로 동일(MSE=0).