forked from tensorflow/models
-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ocr #1
Open
aidinkrmz
wants to merge
67
commits into
theraysmith:master
Choose a base branch
from
0xqq:master
base: master
Could not load branches
Branch not found: {{ refName }}
Loading
Could not load tags
Nothing to show
Loading
Are you sure you want to change the base?
Some commits from the old base branch may be removed from the timeline,
and old review comments may become outdated.
Open
ocr #1
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Corrected "One this" to "Once this" and added comma for proper punctuation on line 702.
This reverts commit c6a4f78. Fixed tensorflow/tensorflow#4981
Revert "Use open() instead of tf.gfile.FastGFile()"
Fix run textsum on a single GPU
Fix broken link in inception readme
added python3 support to read_label_file
For English News Corpus, [Ling et al. (2015)](http://www.cs.cmu.edu/~lingwang/papers/emnlp2015.pdf)'s score is 97.78 -> 97.44 (lower than SyntaxNet and Parsey Mcparseface) according to [Andor et al. (2016)](http://arxiv.org/abs/1603.06042).
Fix POS tagging score of Ling et al.(2005)
"threads" declared twice, so delete one
Add Inception-ResNet-v2 pre-trained model
Fix comment of parameter "output_codes"
Update README.md
Update README.md
Fix end point collection to return a dict
Allow softplacement for ResNet
tf-model-slim doc typo
Explicitly set state_is_tuple=False.
Differential privacy analysis for the privacy model tutorial
privacy-preserving model
Added STREET model for FSNS dataset
Consolidate privacy/ and differential_privacy/.
Fix the BUILD file
Now differential_privacy and privacy are under the same project.
Remove privacy/ after consolidation.
val_captions_file -> captions_val2014.json
Remove comment that TensorFlow must be built from source.
Implementation of Inception V4 (tensorflow#18)
Update compression model README with results for comparison.
Adding list of maintainers Changing model links to point to tensorflow/models repository.
Updating README.md
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Hello
I'm a software engineering student and i use tesseract OCR engine in a university project. For persian language, traineddata which it's a file and it made by Training tesseract 4.00 and LSTM method, has a good result and output in Arial fonts but it doesn't have any good result in some specific fonts for persian. So the questions are :
1- did you use specific fonts like B Nazanin , B Roya or etc in Training Tesseract 4.00 with LSTM or not?
2- if they haven't used how can we use these fonts for getting better result?
I prepared a text that all the cases of litrates have repeated for 10 or 15 or more than 15 times in this text. Also i used the process of training tesseract 3.05 for this text but i didn't get better and beneficial output.
For achieving to a good result in persian in Tesseract OCR engine we need your experience and your help.
Thanks for your attention
Sincerely.