Dacon Monthly Court Judgment Prediction AI Challenge

Algorithm | NLP | Classification | Accuracy

 

private 9th Deberta + data augmentation

공동작성자
2023.07.08 14:59 2,174 Views language

microsoft에서 나온 deberta 모델을 사용하였습니다. 
훈련데이터가 워낙 적다고 생각되서 영어 --> 독일어 --> 영어를 이용한 back translation을 통해 데이터량을 늘렸습니다.
first party 와 second party의 이름과 winner를 바꾼 데이터추가시켜 target imbalance를 최대한 없애봤습니다. 
firstparty 와 second party를 확실하게 구분짓기위해서 모든 본래의 first party와 second party의 이름 대신 wizard 와 sorcerer라는 
임의의 단어로 교체했습니다. 


Code