MESIN PENTERJEMAH BAHASA INDONESIA-BAHASA SUNDA MENGGUNAKAN RECURRENT NEURAL NETWORKS

Yustiana Fauziyah, Ridwan Ilyas, Fatan Kasyidi

Abstract


Penterjemah merupakan suatu proses dimana suatu bahasa diubah ke dalam bahasa lain. Penterjemah pada Penelitian lalu dilakukan dengan menggunakan pendekatan Phrase-based Statistical Machine Translation (PSMT). Penelitian ini membangun sebuah penerjemah Bahasa Indonesia ke Bahasa Sunda. Adapun tahapan yang digunakan dimulai dari pra proses menggunakan text preprocessing dan word embedding Word2Vec dan pendekatan yang digunakan yaitu Neural Machine Translation (NMT) dengan arsitektur Encoder-Decoder yang didalamnya terdapat sebuah Recurrent Neural Network (RNN). Pengujian pada penelitian menghasilkan nilai optimal oleh GRU sebesar 99,17%. Model dengan menggunakan Attention mendapat 99.94%. Penggunaan model optimasi mendapat hasil optimal oleh Adam 99.35% dan hasil BLEU Score dengan optimal bleu 92.63% dan brievity penalty 0.929. Hasil dari mesin penterjemah menghasilkan prediksi pelatihan dari Bahasa Indonesia ke Bahasa Sunda apabila input kalimat sesuai dengan korpus dan hasil terjemahan kurang sesuai ketika input kalimat berbeda dari korpus.


Full Text:

PDF

References


A. A. Suryani, D. H. Widyantoro, A. Purwarianti, and Y. Sudaryat, “Experiment on a phrase-based statistical machine translation using PoS Tag information for Sundanese into Indonesian,” 2015 International Conference on Information Technology Systems and Innovation, ICITSI 2015 - Proceedings, 2016.

A. A. Suryani, I. Arieshanti, B. W. Yohanes, M. Subair, S. D. Budiwati, and B. S. Rintyarna, “Enriching English into Sundanese and Javanese translation list using pivot language,” 2016 International Conference on Information, Communication Technology and System (ICTS), pp. 167–171, 2017.

A. S. and R. K. Gaurav Tiwari, Arushi Sharma, “English - Hindi Neural Machine Translation - LSTM SeqToSeq and ConvS2S,” International Conference on Communication and Signal Processing, pp. 871–875, 2020.

H. Jiang, Y. He, M. Liao, Y. Jing, and C. Zhang, “English-Vietnamese machine translation model based on sequence to sequence algorithm,” Proceedings of 2020 IEEE 5th Information Technology and Mechatronics Engineering Conference, ITOEC 2020, no. Itoec, pp. 1086–1091, 2020.

P. Shah and V. Bakrola, “Neural machine translation system of indic languages - An attention based approach,” arXiv, pp. 1–5, 2020.

G. Klein, J. Senellart, and A. M. Rush, “OpenNMT : Neural Machine Translation Toolkit,” Proceedings of AMTA 2018, vol 1: MT Research Track, vol. 1, pp. 177–184, 2018.

Y. Wu, “A chinese-english machine translation model based on deep neural network,” Proceedings - 2020 International Conference on Intelligent Transportation, Big Data and Smart City, ICITBS 2020, pp. 828–831, 2020.

I. Sutskever, O. Vinyals, and Q. V. Le, “Sequence to sequence learning with neural networks,” Advances in Neural Information Processing Systems, vol. 4, no. January, pp. 3104–3112, 2014.

B. Van Merri, “Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation,” arXiv, 2013.

D. Bahdanau, K. Cho, and Y. Bengio, “Neural Machine Translation By Jointly Learning to Align and Translate,” Published as a conference paper at ICLR, pp. 1–15, 2015.

D. Britz, A. Goldie, M. Luong, and Q. Le, “Massive Exploration of Neural Machine Translation Architectures,” arXiv, 2017.

Y. Liu, D. Zhang, L. Du, Z. Gu, J. Qiu, and Q. Tan, “A simple but effective way to improve the performance of RNN-Based encoder in neural machine translation task,” Proceedings - 2019 IEEE 4th International Conference on Data Science in Cyberspace, DSC 2019, pp. 416–421, 2019.

J. C. Heck and F. M. Salem, “Simplified minimal gated unit variations for recurrent neural networks,” Midwest Symposium on Circuits and Systems, vol. 2017-Augus, pp. 1593–1596, 2017.

A. M. Rush, “Structured Attention Networks,” Published as a conference paper at ICLR, pp. 1–21, 2017.

M. T. Luong, H. Pham, and C. D. Manning, “Effective Approaches to Attention-based Neural Machine Translation,” Conference Proceedings - EMNLP 2015: Conference on Empirical Methods in Natural Language Processing, pp. 1412–1421, 2015.




DOI: https://doi.org/10.33365/jti.v16i2.1930

Refbacks

  • There are currently no refbacks.


Copyright (c) 2022 Yustiana Fauziyah

Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.


JURNAL TEKNOINFO
Published by Universitas Teknokrat Indonesia
Organized by Prodi S1 Informatika FTIK Universitas Teknokrat Indonesia

W: http://ejurnal.teknokrat.ac.id/index.php/teknoinfo/index
E : teknoinfo@teknokrat.ac.id.
Jl. Zainal Abidin Pagaralam, No.9-11, Labuhan Ratu, Bandarlampung

Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
Jumlah Pengunjung : View Teknoinfo StatsCounter

Flag Counter