Attention Mechanism and Context Modeling System for Text Mining Machine Translation

Zhang, Yuwei; Huang, Junming; Liu, Sitong; Chen, Zexi; Li, Zizheng

Computer Science > Computation and Language

arXiv:2408.04216 (cs)

[Submitted on 8 Aug 2024 (v1), last revised 18 Jan 2025 (this version, v3)]

Title:Attention Mechanism and Context Modeling System for Text Mining Machine Translation

Authors:Yuwei Zhang, Junming Huang, Sitong Liu, Zexi Chen, Zizheng Li

View PDF

Abstract:This paper advances a novel architectural schema anchored upon the Transformer paradigm and innovatively amalgamates the K-means categorization algorithm to augment the contextual apprehension capabilities of the schema. The transformer model performs well in machine translation tasks due to its parallel computing power and multi-head attention mechanism. However, it may encounter contextual ambiguity or ignore local features when dealing with highly complex language structures. To circumvent this constraint, this exposition incorporates the K-Means algorithm, which is used to stratify the lexis and idioms of the input textual matter, thereby facilitating superior identification and preservation of the local structure and contextual intelligence of the language. The advantage of this combination is that K-Means can automatically discover the topic or concept regions in the text, which may be directly related to translation quality. Consequently, the schema contrived herein enlists K-Means as a preparatory phase antecedent to the Transformer and recalibrates the multi-head attention weights to assist in the discrimination of lexis and idioms bearing analogous semantics or functionalities. This ensures the schema accords heightened regard to the contextual intelligence embodied by these clusters during the training phase, rather than merely focusing on locational intelligence.

Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2408.04216 [cs.CL]
	(or arXiv:2408.04216v3 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2408.04216

Submission history

From: Shi Bo [view email]
[v1] Thu, 8 Aug 2024 04:52:10 UTC (631 KB)
[v2] Sun, 29 Dec 2024 19:00:55 UTC (340 KB)
[v3] Sat, 18 Jan 2025 00:29:19 UTC (631 KB)

Computer Science > Computation and Language

Title:Attention Mechanism and Context Modeling System for Text Mining Machine Translation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Attention Mechanism and Context Modeling System for Text Mining Machine Translation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators