Harmonized System Code Classification Using Transfer Learning with Pre-Trained Weights
dc.contributor.author | Pain, Koustav | |
dc.contributor.copyright-release | Not Applicable | en_US |
dc.contributor.degree | Master of Computer Science | en_US |
dc.contributor.department | Faculty of Computer Science | en_US |
dc.contributor.ethics-approval | Not Applicable | en_US |
dc.contributor.external-examiner | n/a | en_US |
dc.contributor.graduate-coordinator | Dr. Evangelos Milios | en_US |
dc.contributor.manuscripts | Not Applicable | en_US |
dc.contributor.thesis-reader | Dr. Evangelos Milios | en_US |
dc.contributor.thesis-reader | Dr. Srinivas Sampalli | en_US |
dc.contributor.thesis-supervisor | Dr. Vlado Keselj | en_US |
dc.date.accessioned | 2021-08-13T12:53:48Z | |
dc.date.available | 2021-08-13T12:53:48Z | |
dc.date.defence | 2021-07-26 | |
dc.date.issued | 2021-08-13T12:53:48Z | |
dc.description.abstract | The Harmonized System (HS) was developed as a multipurpose international product nomenclature that describes the type of good that is shipped. It allows customs authorities to identify and clear every commodity that enters or crosses any international borders. HS classification is to identify the HS code of a commodity according to its description information in a trade manifest. Compared with general text classification the challenge of this task is that commodity description texts are often short, unstructured and extremely noisy. HS misclassification can lead to penalties, fines and delays upon import. We first propose novel approaches for extracting and filtering relevant commodity information from a trade document. Then our HS classification methodology utilizes pre-trained STS models via deep transfer learning using sentence-level transfer. We also introduce a new evaluation method to properly evaluate our approach based on real-world applications. Extensive experiments and model comparisons show the superiority of our approach. | en_US |
dc.identifier.uri | http://hdl.handle.net/10222/80672 | |
dc.language.iso | en | en_US |
dc.subject | Customs Clearance | en_US |
dc.subject | International Shipping | en_US |
dc.subject | Short and Noisy Texts with more than Six Thousand Classes | en_US |
dc.subject | Semantic Textual Similarity based Text Classification | en_US |
dc.subject | Natural Language Processing | en_US |
dc.subject | Deep Transfer Learning | en_US |
dc.title | Harmonized System Code Classification Using Transfer Learning with Pre-Trained Weights | en_US |
dc.type | Thesis | en_US |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- KoustavPain2021.pdf
- Size:
- 5.5 MB
- Format:
- Adobe Portable Document Format
- Description:
- Full Thesis Document
License bundle
1 - 1 of 1
No Thumbnail Available
- Name:
- license.txt
- Size:
- 1.71 KB
- Format:
- Item-specific license agreed upon to submission
- Description: