Semantic Paraphrase Generation Using Transformer Architectures: A Comparative Study of Pre-trained and Fine-Tuned Models

RAHUL Birwadkar

doi:10.5937/jcfs4-64420

PDF

Published

2026-05-06

Section

Članci

Abstract

Semantic paraphrase generation plays a crucial role in academic and technical writing by enabling authors to restate content while preserving its original meaning. Traditional paraphrasing approaches, such as rule-based rewriting and statistical methods, often struggle to maintain semantic consistency and linguistic fluency, especially for complex or longer text segments. Recent advances in transformer-based architectures have significantly improved text generation capabilities by leveraging contextual representations and self-attention mechanisms. This paper presents a comparative study of pre-trained and fine-tuned transformer models for semantic paraphrase generation. We evaluate encoder–decoder–based transformer architectures, with a primary focus on the BART model in both pre-trained and fine-tuned settings, alongside a large generative language model used for paraphrase generation. The fine-tuning process adapts pre-trained models to paraphrasing tasks using task-specific data, enabling improved control over semantic preservation and output consistency. The evaluation is conducted using both quantitative and qualitative analysis, including training and validation loss trends and comparative examination of generated paraphrases. Experimental results demonstrate that fine tuned transformer models produce paraphrases with higher semantic fidelity and structural coherence compared to their pre-trained counterparts, while large generative models offer fluent but less deterministic outputs. The findings highlight the importance of task-specific fine-tuning for controlled and semantically accurate paraphrase generation. This study contributes practical insights into the selection and adaptation of transformer architectures for paraphrasing applications, particularly in academic and research-oriented writing contexts.

Keywords

Array
Array
Array
Array
Array

DOI: 10.5937/jcfs4-64420

References

I (we), the author(s), hereby declare under full moral, financial and criminal liability that the manuscript submitted for publication to the Journal of Computer and Forensic Sciences

a) is the result of my (our) own original research and that I (we) hold the right to publish it;

b) does not infringe any copyright or other third-party proprietary rights;

c) complies with the Journal’s research and publishing ethics standards;

d) has not been published elsewhere, under this or any other title;

e) is not under consideration by another publication, under this or any other title.

I (we) also declare under full moral, financial and criminal liability:

f) that all conflicts of interest that may directly or potentially influence or impart bias on the work have been disclosed in the manuscript;

g) that if the article has been accepted for publishing I (we) will transfer all copyright ownership of the manuscript to the University of Criminal Investigation and Police Studies in Belgrade.

Signed by the Corresponding Author on behalf of the all other authors.

Downloads

Download data is not yet available.