What is the bates number? - pdf

I am trying to learn more about PDF and the programmatic methods. So I read many related blogs of some companies, like ComPDFKit.
There is no much more information about bates number. Could anyone just tell me what the bates number is, and how to use it?

Related

What's bates number

I am trying to learn more about PDF and the programmatic methods. So I read many related blogs of some companies, like ComPDFKit.
There is no much more information about bates number. Could anyone just tell me what the bates number is, and how to use it?
I will thank you a lot, If you know the method in code.

Need a suitable scripting language for below requirement of PDF documents

We have project requirement to validate PDF files which would contain below things for different policies.
Page Number
Images (screen shots)
Here we want to validate whether all the pages have images(screen shots), number of images in the PDF, image duplication and empty pages.
Please suggest me a suitable scripting language and way to fulfill our requirement.
Note:- Each policy will have different set screen shots and hence the total no of pages and image content for each PDF will vary.
Thanks in Advance!
I've had to validate a lot of PDFs and found this toolkit very useful http://euske.github.io/pdfminer/index.html . It's written in Python, but comes with an excellent pdfdump utility which lets you look at the page number of each pages and all the elements in that page.
Having said that, I've only used it for text and am not sure how it identifies images.
I would comment on Kim Ryan's answer, except that I don't have enough reputation to comment yet, which seems pretty silly.
In any case, I agree with Kim that pdfminer is probably your best bet overall. However, I would mention that looking for images isn't all that difficult, and there is an "extract" example in the pdfrw library that will find images and pull them out to a separate PDF file. I don't think it would be very hard to modify it to match images to page numbers. I am the pdfrw author, so you can email me (address at github) if you have any questions on this.

Rating system for multiple models in Application

I need suggestion for rating system implementation. I have Blogs, Discussions, Comments in my system, so each member can make +1 to current blog post or discussion entry, comment etc. I know that there is acts_as_ratable, but this is no what I need.
Any suggestions, what can I use to keep my code DRY?
P.S - I know it can be handled using polymorphic associations, but I'm searching for basic solution to integrate.
Thanks in advance!
This is a perfect case for Rails' polymorphic associations -- several different types are "ratable".
Good doc on this here:
http://guides.rubyonrails.org/association_basics.html#polymorphic-associations
And great screencast here:
http://railscasts.com/episodes/154-polymorphic-association

Sampling on Yahoo! Answers

I wonder what is the best way to sample,say, 1000 questions,completely randomly from Yahoo! Answer.
I want to achieve this complete randomness in which I will totally ignore the categories or date of posting etc.
Doing this manually may result in bias,so could anyone give some suggestions here,like using Yahoo! Answer API or sth.
Thanks a lot.
I do not know if it is correct solution from a formal point of view but I would use yahoo boss search to retrieve 4000 questions, and than randomly pick up 1000. Using a search engine let you to retrieve the most important (highly ranked/linked) questions. You can play around with queries for the search engine to get questions of all kinds - most popular and the worst ones... There is also Yahoo Answer API, which provide search functionality but I have not used it so I can not say how good it is.

References on best practices for registration keys/access codes format

I am developing an online site to which access will be sold at college bookstores. Students will purchase a card at the bookstore with an access code that they may then use to register online at our site.
I want to make the code as user friendly as possible. I personally hate registering for a product and having to type in a registration key 5 times because it's ambiguous.
Can anyone point me to resources describing best practices for designing the format of the code itself? Obvious things spring to mind-- don't use zeroes or the letter O, don't make it case sensitive, include some kind of checksum. I don't want to be creative here, I need a recipe for what must be a problem solved many times.
Joel Spolsky had some good insights to solving this problem in one of the recent StackOverflow podcasts. I believe the episode was #49, you should download podcasts or checkout the transcripts at https://stackoverflow.fogbugz.com/default.asp?W4
Dealing with 0 (number) and O (letter) mixed in a key is really annoying as some fonts make it hard to distinguish the two.
Other usability concepts such as groups of three being easier to deal with and remember then a single number are good to be aware of. For example, 345-829-817-432 instead of 345829817432.
By the way, 345-829-817-432 gives you 12^10 permutations, and even the smaller number 345-829-817 gives you 9^10 permutations which may give you enough strength depending on your situation.
tschüss,
bn
It really depends on how much security you need. A few ideas come to mind.
If you want something really simple you could generate simulated credit card numbers; students are adept at using these four-digit combinations, and they can be checked with a Luhn algorithm.
If you want something a little stronger you could generate a GUID, and use that as the code.
If your website can send emails you can ask the student for their email address, and send them a challenge/response email. Then you don't need codes at all. Their email address is the code.