captioning order

نتایج جستجو برای: captioning order

تعداد نتایج: 908879 فیلتر نتایج به سال:

Subtitling for d/Deaf and Hard-of-Hearing Children: Current Practices and New Possibilities to Enhance Language Development

2017

Ana Tamayo Frederic Chaume

In order to understand and fully comprehend a subtitle, two parameters within the linguistic code of audiovisual texts are key in the processing of the subtitle itself, namely, vocabulary and syntax. Through a descriptive and experimental study, the present article explores the transfer of the linguistic code of audiovisual texts in subtitling for deaf and hard-of-hearing children in three Span...

متن کامل

Boost image captioning with knowledge reasoning

Journal: :Machine Learning 2020

متن کامل

Topic scene graphs for image captioning

Journal: :Iet Computer Vision 2022

When describing an image, people can rapidly extract the topic from image and find main object, generating sentences that match idea of image. However, most scene graph generation methods do not emphasise importance Consequently, captions generated by graph-based captioning models cannot reflect in then expressing central In this paper, we propose a method for based on graphs (TSG). Firstly, st...

متن کامل

Semantic Grouping Network for Video Captioning

Journal: :Proceedings of the ... AAAI Conference on Artificial Intelligence 2021

This paper considers a video caption generating network referred to as Semantic Grouping Network (SGN) that attempts (1) group frames with discriminating word phrases of partially decoded and then (2) decode those semantically aligned groups in predicting the next word. As consecutive are not likely provide unique information, prior methods have focused on discarding or merging repetitive infor...

متن کامل

Supplementary Material: Multi-Task Video Captioning with Video and Entailment Generation

2017

Ramakanth Pasunuru Mohit Bansal

1.1.1 Video Captioning Datasets YouTube2Text or MSVD The Microsoft Research Video Description Corpus (MSVD) or YouTube2Text (Chen and Dolan, 2011) is used for our primary video captioning experiments. It has 1970 YouTube videos in the wild with many diverse captions in multiple languages for each video. Caption annotations to these videos are collected using Amazon Mechanical Turk (AMT). All ou...

متن کامل

Multimodal Attention for Neural Machine Translation

Journal: :CoRR 2016

Ozan Caglayan Loïc Barrault Fethi Bougares

The attention mechanism is an important part of the neural machine translation (NMT) where it was reported to produce richer source representation compared to fixed-length encoding sequence-to-sequence models. Recently, the effectiveness of attention has also been explored in the context of image captioning. In this work, we assess the feasibility of a multimodal attention mechanism that simult...

متن کامل

Contrastive Learning for Image Captioning

2017

Bo Dai Dahua Lin

Image captioning, a popular topic in computer vision, has achieved substantial progress in recent years. However, the distinctiveness of natural descriptions is often overlooked in previous work. It is closely related to the quality of captions, as distinctive captions are more likely to describe images with their unique aspects. In this work, we propose a new learning method, Contrastive Learn...

متن کامل

Captioning Transformer with Stacked Attention Modules

Journal: :Applied Sciences 2018

متن کامل

Fusion Models for Improved Image Captioning

Journal: :Lecture Notes in Computer Science 2021

Visual captioning aims to generate textual descriptions given images or videos. Traditionally, image models are trained on human annotated datasets such as Flickr30k and MS-COCO, which limited in size diversity. This limitation hinders the generalization capabilities of these while also rendering them liable making mistakes. Language can, however, be vast amounts freely available unlabelled dat...

متن کامل

Generalized Image Captioning for Multilingual Support

Journal: :Applied sciences 2023

Image captioning is a problem of viewing images and describing in language. This an important that can be solved by understanding the image, combining two fields image processing natural language into one. The purpose research so far has been to create general explanatory captions learning data. However, various environments reality must considered for practical use, as well descriptions suit u...

متن کامل

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید