視覚センサのシーン画像におけるテキスト認識のためのテキスト位置補正とエンコーダ-デコーダネットワークに基づくアルゴリズム【JST・京大機械翻訳】

Huang Zhiwei; Huang Zhiwei; Lin Jinzhao; Yang Hongzhi; Wang Huiqian; Bai Tong; Liu Qinghui; Pang Yu

文献

J-GLOBAL ID：202102283181975438 整理番号：21A2529545

視覚センサのシーン画像におけるテキスト認識のためのテキスト位置補正とエンコーダ-デコーダネットワークに基づくアルゴリズム【JST・京大機械翻訳】

An Algorithm Based on Text Position Correction and Encoder-Decoder Network for Text Recognition in the Scene Image of Visual Sensors

出版者サイト複写サービスで全文入手 {{ this.onShowCLink("http://jdream3.com/copy/?sid=JGLOBAL&noSystem=1&documentNoArray=21A2529545&COPY=1") }}
高度な検索・分析はJDreamⅢで {{ this.onShowJLink("http://jdream3.com/lp/jglobal/index.html?docNo=21A2529545&from=J-GLOBAL&jstjournalNo=U7015A") }}

著者 (8件)： , , , , , , ,
資料名：
巻： 20 号： 10 ページ： 2942 発行年： 2020年
JST資料番号： U7015A ISSN： 1424-8220 CODEN： SENSC9 資料種別：逐次刊行物 (A)
記事区分：原著論文発行国：スイス (CHE) 言語：英語 (EN)

自然シーン画像におけるテキスト認識は,文書画像関連視覚センサの分野で常に最新の話題である。以前の文献は,水平テキスト認識の問題をほとんど解決するが,自然場面におけるテキストは通常,傾斜し,不規則であり,多くの未解決問題がある。このため,テキスト位置補正(TPC)モジュールと符号器デコーダネットワーク(EDN)モジュールに基づくシーンテキスト認識アルゴリズムを提案した。最初に,傾斜テキストをTPCモジュールを通して水平テキストに修正して,次に,水平テキストの内容をEDNモジュールを通して正確に同定した。標準データセットに関する実験は,アルゴリズムが多くの種類の不規則なテキストを認識して,より良い結果を得ることができることを示した。アブレーション研究は,提案した2つのネットワークモジュールが不規則なシーンテキスト認識の精度を強化することができることを示した。Copyright 2021 The Author(s) All rights reserved. Translated from English into Japanese by JST.【JST・京大機械翻訳】

, , , , ,
, , , 【Automatic Indexing@JST】

著者キーワード (4件)： , , ,

パターン認識 , 図形・画像処理一般

引用文献 (39件)：

Zhu, L.; Shen, J.; Xie, L. Unsupervised topic hypergraph hashing for efficient mobile image retrieval. IEEE Trans. Cybern. 2017, 47, 3941-3954.
Piras, L.; Giacinto, G. Information fusion in content based image retrieval: A comprehensive overview. Inf. Fusion 2017, 37, 50-60.
Bulan, O.; Kozitsky, V.; Ramesh, P. Segmentation-and annotation-free license plate recognition with deep localization and failure identification. IEEE Trans. Intell. Transp. Syst. 2017, 18, 2351-2363.
Islam, K.T.; Raj, R.G.; Mujtaba, G. Recognition of traffic sign based on bag-of-words and artificial neural network. Symmetry 2017, 9, 138.
Wang, X.J.; Zhang, L.; Ma, W.Y. Text to Image Translation. US Patent 9678992B2, 13 June 2017.

, , , , ,

前のページに戻る