E-raamat: Multilingual Text Recognition: A Deep Learning Approach

Ruijie Yan, Liangrui Peng

Formaat: EPUB+DRM
Sari: SpringerBriefs in Computer Science
Ilmumisaeg: 01-Jan-2026
Kirjastus: Springer Nature Switzerland AG
Keel: eng
ISBN-13: 9789819678983

Teised raamatud teemal:

Formaat - EPUB+DRM
Hind: 55,56 €*
* hind on lõplik, st. muud allahindlused enam ei rakendu
Lisa ostukorvi
Lisa soovinimekirja
See e-raamat on mõeldud ainult isiklikuks kasutamiseks. E-raamatuid ei saa tagastada.

Formaat: EPUB+DRM
Sari: SpringerBriefs in Computer Science
Ilmumisaeg: 01-Jan-2026
Kirjastus: Springer Nature Switzerland AG
Keel: eng
ISBN-13: 9789819678983

Teised raamatud teemal:

DRM piirangud

Kopeerimine (copy/paste):

ei ole lubatud
Printimine:

ei ole lubatud
Kasutamine:

Digitaalõiguste kaitse (DRM)
Kirjastus on väljastanud selle e-raamatu krüpteeritud kujul, mis tähendab, et selle lugemiseks peate installeerima spetsiaalse tarkvara. Samuti peate looma endale Adobe ID Rohkem infot siin. E-raamatut saab lugeda 1 kasutaja ning alla laadida kuni 6'de seadmesse (kõik autoriseeritud sama Adobe ID-ga).

Vajalik tarkvara
Mobiilsetes seadmetes (telefon või tahvelarvuti) lugemiseks peate installeerima selle tasuta rakenduse: PocketBook Reader (iOS / Android)

PC või Mac seadmes lugemiseks peate installima Adobe Digital Editionsi (Seeon tasuta rakendus spetsiaalselt e-raamatute lugemiseks. Seda ei tohi segamini ajada Adober Reader'iga, mis tõenäoliselt on juba teie arvutisse installeeritud )

Seda e-raamatut ei saa lugeda Amazon Kindle's.

Multilingual text recognition is crucial for cross language information acquisition and related applications in the mobile computing era. The core problem is to find efficient representation and decoding methods for multilingual text recognition, including scene text recognition or handwriting recognition tasks.This book introduces a novel deep learning framework termed Primitive Representation Learning for sequence modeling. In contrast to conventional approaches that employ either (1) convolutional neural networks (CNNs) combined with recurrent neural networks (RNNs) and connectionist temporal classification (CTC) for decoding, or (2) attention-based encoder-decoder architectures, the proposed framework offers an alternative paradigm for sequence representation and processing. Primitive representations are learned via global feature aggregation and then transformed into high level visual text representations via a graph convolutional network, which enables parallel decoding for text transcription. Multielement attention mechanism and temporal residual mechanism are further introduced to enhance the utilization of spatial and temporal feature information.

The methods presented in this book have been evaluated on public datasets and applied to scene text recognition and handwriting recognition systems. Readers will gain a better understanding of state of the art methods and research findings in multilingual scene text recognition, handwriting recognition, and related fields. The prerequisites needed to understand this book include basic knowledge for machine learning and deep learning.

Chapter 1 Introduction.
Chapter 2 Primitive Representation Learning.-
Chapter 3 Multielement Attention Mechanism.
Chapter 4 Dynamic Temporal
Residual Learning and Attention Rectification.
Chapter 5 TH-DL Multilingual
Text Recognition System Framework.

Liangrui Peng is currently an associate professor at the Department of Electronic Engineering, Tsinghua University, Beijing, China. She received her Ph.D. degree in Information and Communication Engineering from Tsinghua University in 2010. Her research interests include multilingual text recognition and understanding, computer vision and machine learning. She has received the National Awards for Science and Technology Progress (Second Class) in China three times. Her recent research work with graduate students has advanced multilingual text recognition, receiving multiple awards including the DAS 2016 Best Paper Award, the ICDAR 2019 Best Student Paper Runner Up Award, and the DRR 2015 Best Student Paper Award.

Ruijie Yan received his B.Sc. degree in 2017 and Ph.D. degree in 2022 from the Department of Electronic Engineering at Tsinghua University, Beijing, China. He is currently a senior applied scientist at Microsoft (China) Co. Ltd. His research interests include computer vision and machine learning. He has published several papers in venues such as CVPR, ECCV, etc., and won the ICPR 2020 and ICDAR 2017 Arabic Video Text Detection and Recognition competitions.

Lisainfo e-raamatute kohta