Transformerを含むネットワークによる4Kリアルタイム画像変換

柴崎圭; 福崎翔太; 池原雅章

文献

J-GLOBAL ID：202202264878863946 整理番号：22A1209340

Transformerを含むネットワークによる4Kリアルタイム画像変換

4K real time image to image translation network including Transformer

出版者サイト {{ this.onShowPLink() }} 複写サービスで全文入手 {{ this.onShowCLink("http://jdream3.com/copy/?sid=JGLOBAL&noSystem=1&documentNoArray=22A1209340&COPY=1") }}
高度な検索・分析はJDreamⅢで {{ this.onShowJLink("http://jdream3.com/lp/jglobal/index.html?docNo=22A1209340&from=J-GLOBAL&jstjournalNo=U2030A") }}

著者 (3件)： , ,
資料名：
巻： 121 号： 420(IMQ2021 10-69) ページ： 209-215 (WEB ONLY) 発行年： 2022年03月02日
JST資料番号： U2030A ISSN： 2432-6380 資料種別：会議録 (C)
記事区分：短報発行国：日本 (JPN) 言語：日本語 (JA)

近年,Transformerをコンピュータビジョンに応用したネットワークが注目を集めており,優れた結果を残しているが,計算量やメモリの使用量が欠点でもある.そこで,本論文ではImage to Image TranslationのネットワークであるLaplacian Pyramid Translation Transformer(LPTT)を提案する.LPTTはラプラシアンピラミッドを作成することで計算量やメモリの使用量を抑えつつTransformerの表現力を得ており,従来手法と比べて優れた結果を残している.LPTTはTransformerを含むネットワークで4Kほどの高解像度画像に対してリアルタイム推論が行える初めてのネットワークである.また,LPTTは条件によっては8K画像もリアルタイムで推論できる.また,本論文では,高解像度の画像を処理する場合でもTransformerに低解像度の成分を計算させるだけで性能を上げることができるということを示唆している.(著者抄録)

, , , , , , , , , ,
, , ,

自然語処理 , ニューロコンピュータ

引用文献 (21件)：

P. Isola, J.-Y. Zhu, T. Zhou, and A.A. Efros, “Image-to-image translation with conditional adversarial networks,” Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp.●●-●●, July 2017.
T.-C. Wang, M.-Y. Liu, J.-Y. Zhu, A. Tao, J. Kautz, and B. Catanzaro, “High-resolution image synthesis and semantic manipulation with conditional gans,” Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp.●●-●●, June 2018.
J.-Y. Zhu, T. Park, P. Isola, and A.A. Efros, “Unpaired image-to-image translation using cycle-consistent adversarial networks,” Proceedings of the IEEE International Conference on Computer Vision (ICCV), pp.●●-●●, Oct. 2017.
H.-Y. Lee, H.-Y. Tseng, J.-B. Huang, M. Singh, and M.-H. Yang, “Diverse image-to-image translation via disentangled representations,” Proceedings of the European Conference on Computer Vision (ECCV), pp.●●-●●, Sept. 2018.
Y. Li, M.-Y. Liu, X. Li, M.-H. Yang, and J. Kautz, “A closed-form solution to photorealistic image stylization,” Proceedings of the European Conference on Computer Vision (ECCV), pp.●●-●●, Sept. 2018.

前のページに戻る