Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

长文本识别效果如何改进? #22

Open
2017ZYS opened this issue Feb 21, 2022 · 1 comment
Open

长文本识别效果如何改进? #22

2017ZYS opened this issue Feb 21, 2022 · 1 comment

Comments

@2017ZYS
Copy link

2017ZYS commented Feb 21, 2022

默认训练图片是3*48*160进入网络,我尝试使用1*32*320训练中文数据集,STRING_MAX_LEN设置为150(足够满足大部分文本行长度)但是我在测试的时候发现,如果强制resize到32*320,有些长文本效果并不是很好,想请教一下有哪些改进方法?

@VixeruntR
Copy link

请问这个问题有进展吗?我训练尺寸是48*240,短文本识别效果蛮好,但是长文本就几乎全错了。
有时候还会出现这类现象:比如图像上内容是“年龄:15”,但是预测结果却是“年龄:1555555555555555555555555555555555555555555555555......”好多个。

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants