KLASIFIKASI SENTIMEN VAKSIN COVID-19 MENGGUNAKAN K-NEAREST NEIGHBOR BERDASARKAN WORD EMBEDDINGS FASTTEXT PADA TWITTER

نویسندگان

چکیده

Pada akhir 2019 muncul penyakit semacam flu yang menginfeksi paru-paru di kota Wuhan. Diduga tersebut diduga berasal dari kelelawar. WHO memberi nama ini dengan Covid-19 dan virus tersebar ke seluruh dunia sehingga menyebabkan pandemi. Pemerintah mengambil indakan vaksinasi untuk mengatasi ini, namun mendapat respon pro kontra masyarakat. Ada banyak penelitian membahas sentimen masyarakat terhadap salah satunya adalah klasifikasi sentimen. Penelitian vaksin covid-19 menggunakan algoritma K-Nearest Neighbor Fasttext pada twitter. Data diperoleh cara crawling bahasa pemograman pyton Twitter API. Pelabelan data dilakukan teknik crowdsourcing majority voting. digunakan setelah proses penyeimbangan 6000 training, 778 development 400 test. Hasil pengujian berbagai eksperimen feature engineering mendapatkan hasil terbaik nilai akurasi 69% f1-score 60%. merupakan dibanding sebelumnya dataset sama.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Second-Order Word Embeddings from Nearest Neighbor Topological Features

We introduce second-order vector representations of words, induced from nearest neighborhood topological features in pre-trained contextual word embeddings. We then analyze the effects of using second-order embeddings as input features in two deep natural language processing models, for named entity recognition and recognizing textual entailment, as well as a linear model for paraphrase recogni...

متن کامل

Fast Nearest Neighbor Preserving Embeddings

We show an analog to the Fast Johnson-Lindenstrauss Transform for Nearest Neighbor Preserving Embeddings in `2. These are sparse, randomized embeddings that preserve the (approximate) nearest neighbors. The dimensionality of the embedding space is bounded not by the size of the embedded set n, but by its doubling dimension λ. For most large real-world datasets this will mean a considerably lowe...

متن کامل

Klasifikasi Data Cardiotocography Dengan Integrasi Metode Neural Network Dan Particle Swarm Optimization

Backpropagation (BP) adalah sebuah metode yang digunakan dalam training Neural Network (NN) untuk menentukan parameter bobot yang sesuai. Proses penentuan parameter bobot dengan menggunakan metode backpropagation sangat dipengaruhi oleh pemilihan nilai learning rate (LR)-nya. Penggunaan nilai learning rate yang kurang optimal berdampak pada waktu komputasi yang lama atau akurasi klasifikasi yan...

متن کامل

Drought Monitoring and Prediction using K-Nearest Neighbor Algorithm

Drought is a climate phenomenon which might occur in any climate condition and all regions on the earth. Effective drought management depends on the application of appropriate drought indices. Drought indices are variables which are used to detect and characterize drought conditions. In this study, it was tried to predict drought occurrence, based on the standard precipitation index (SPI), usin...

متن کامل

Fast Approximate Nearest-Neighbor Search with k-Nearest Neighbor Graph

We introduce a new nearest neighbor search algorithm. The algorithm builds a nearest neighbor graph in an offline phase and when queried with a new point, performs hill-climbing starting from a randomly sampled node of the graph. We provide theoretical guarantees for the accuracy and the computational complexity and empirically show the effectiveness of this algorithm.

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Zonasi

سال: 2023

ISSN: ['2656-7407', '2656-7393']

DOI: https://doi.org/10.31849/zn.v5i2.12548