论文学习
Attention and language ensemble for scene text recognition

发布于44个月以前

  • 0
  • 0
  • 349

发布于44个月以前

Attention and Language Ensemble for Scene Text Recognition with Convolutional Sequence Modeling

Recent dominant approaches for scene text recognition are mainly based on convolutional neural network (CNN) and recurrent neural network (RNN), where the CNN processes images and the RNN generates character sequences. Different from these methods, we propose an attention-based architecture1 which is completely based on CNNs. The distinctive characteristics of our method include: (1) the method follows encoder-decoder architecture, in which the encoder is a two-dimensional residual CNN and the decoder is a deep one-dimensional CNN. (2) An attention module that captures visual cues, and a language module that models linguistic rules are designed equally in the decoder. Therefore the attention and language can be viewed as an ensemble to boost predictions jointly. (3) Instead of using a single loss from language aspect, multiple losses from attention and language are accumulated for training the networks in an end-to-end way. We conduct experiments on standard datasets for scene text recognition, including Street View Text, IIIT5K and ICDAR datasets. The experimental results show our CNN-based method has achieved state-of-the-art performance on several benchmark datasets, even without the use of RNN.

论文下载

论文地址:https://dl.acm.org/doi/pdf/10.1145/3240508.3240571

算法链接

算法
https://marketplace.huaweicloud.com/markets/aihub/modelhub/detail/?id=5686e7e3-19ee-4991-adcd-29fd73b4090b

算法指南

算法指南https://bbs.huaweicloud.com/forum/forum.php?mod=viewthread&tid=92736&page=1&extra=#pid537546

论文精读

评论 0

登录后评论

    spy

    作者相关内容

    “挑战杯”揭榜挂帅-华为云专项赛 疲劳/分神驾驶检测Baseline
    发布于23个月以前
    无人车大赛-baseline
    发布于33个月以前
    2022 船舶大赛ModelArts 操作指导
    发布于35个月以前
    AnimeGANv2动漫人脸实践
    发布于28个月以前
    在ModelArts上创建自己的算法-生成模型-在线推理(包括详细代码)
    发布于46个月以前

    暂无数据

    热门内容推荐

    快速体验10个精选论文复现算法模型,赢取蓝牙音箱、体脂秤、ModelArts图书!
    WAYNE 发布于44个月以前
    华为云AI论文精读会2021第二十期:轻量化神经网络MobileNet系列论文精读
    euuufe 发布于41个月以前
    [华为云AI经典论文复现] AI Gallery CrowdDet算法使用介绍
    euuufe 发布于42个月以前
    Fast-SCNN: Fast Semantic Segmentation Network
    spy 发布于44个月以前
    华为云AI论文精读会2021第四期:Dynamic RCNN:一种有效提升RCNN系列网络表现的动态训练方法
    euuufe 发布于42个月以前

    暂无数据