نتایج جستجو برای: captioning order

تعداد نتایج: 908879  

Journal: :CoRR 2017
Xin Wang Wenhu Chen Jiawei Wu Yuan-Fang Wang William Yang Wang

Video captioning is the task of automatically generating a textual description of the actions in a video. Although previous work (e.g. sequence-to-sequence model) has shown promising results in abstracting a coarse description of a short video, it is still very challenging to caption a video containing multiple fine-grained actions with a detailed description. This paper aims to address the cha...

Journal: :CoRR 2017
Satoshi Tsutsui David J. Crandall

Recent work in computer vision has yielded impressive results in automatically describing images with natural language. Most of these systems generate captions in a single language, requiring multiple language-specific models to build a multilingual captioning system. We propose a very simple technique to build a single unified model across languages, using artificial tokens to control the lang...

2015
Jack Hessel Nicolas Savva Michael J. Wilber

We examine the possibility that recent promising results in automatic caption generation are due primarily to language models. By varying image representation quality produced by a convolutional neural network, we find that a state-of-theart neural captioning algorithm is able to produce quality captions even when provided with surprisingly poor image representations. We replicate this result i...

2017
Mert Kilickaya Aykut Erdem Nazli Ikizler-Cinbis Erkut Erdem

The task of generating natural language descriptions from images has received a lot of attention in recent years. Consequently, it is becoming increasingly important to evaluate such image captioning approaches in an automatic manner. In this paper, we provide an in-depth evaluation of the existing image captioning metrics through a series of carefully designed experiments. Moreover, we explore...

2017
Kuang-Yu Chang Kung-Hung Lu Chu-Song Chen

As mentioned in our paper, we study a new problem, aesthetic critiques generation, which is different from conventional image captioning. Many recent works [1, 2] started to argued that conventional evaluation criteria (BLEU, METEOR and CIDEr) borrowed from machine translation community are unsuitable for image captioning task. How to choose a suitable criterion is still a tricky problem in ima...

2018
Jiasen Lu Jianwei Yang Dhruv Batra Devi Parikh

We introduce a novel framework for image captioning that can produce natural language explicitly grounded in entities that object detectors find in the image. Our approach reconciles classical slot filling approaches (that are generally better grounded in images) with modern neural captioning approaches (that are generally more natural sounding and accurate). Our approach first generates a sent...

2018
Xihui Liu Hongsheng Li Jing Shao Dapeng Chen Xiaogang Wang

The aim of image captioning is to generate similar captions by machine as human do to describe image contents. Despite many efforts, generating discriminative captions for images remains non-trivial. Most traditional approaches imitate the language structure patterns, thus tend to fall into a stereotype of replicating frequent phrases or sentences and neglect unique aspects of each image. In th...

Journal: :CoRR 2017
Ali Sharifi Boroujerdi Maryam Khanian Michael Breuß

With recent innovations in dense image captioning, it is now possible to describe every object of the scene with a caption while objects are determined by bounding boxes. However, interpretation of such an output is not trivial due to the existence of many overlapping bounding boxes. Furthermore, in current captioning frameworks, the user is not able to involve personal preferences to exclude o...

Journal: :CoRR 2016
Karan Sharma Arun C. S. Kumar Suchendra M. Bhandarkar

Significant performance gains in deep learning coupled with the exponential growth of image and video data on the Internet have resulted in the recent emergence of automated image captioning systems. Ensuring scalability of automated image captioning systems with respect to the ever increasing volume of image and video data is a significant challenge. This paper provides a valuable insight in t...

Journal: :Advances in intelligent systems and computing 2021

With the huge expansion of Internet and trillions gigabytes data generated every single day, needs for development various tools have become mandatory in order to maintain system adaptability rapid changes. One these is known as image captioning. Every entity must be properly identified managed, therefore case data, automatic captioning identification required. Similarly, content generation mis...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید