<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
	>

<channel>
	<title>On/Off &#187; ocr</title>
	<atom:link href="http://blog.yoavfarhi.com/tag/ocr/feed/" rel="self" type="application/rss+xml" />
	<link>http://blog.yoavfarhi.com</link>
	<description>Yoav Farhi&#039;s blog</description>
	<lastBuildDate>Sun, 05 Feb 2012 04:36:20 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.com/</generator>
<cloud domain='blog.yoavfarhi.com' port='80' path='/?rsscloud=notify' registerProcedure='' protocol='http-post' />
<image>
		<url>http://s2.wp.com/i/buttonw-com.png</url>
		<title>On/Off &#187; ocr</title>
		<link>http://blog.yoavfarhi.com</link>
	</image>
	<atom:link rel="search" type="application/opensearchdescription+xml" href="http://blog.yoavfarhi.com/osd.xml" title="On/Off" />
	<atom:link rel='hub' href='http://blog.yoavfarhi.com/?pushpress=hub'/>
		<item>
		<title>Using Google for OCR</title>
		<link>http://blog.yoavfarhi.com/2008/11/01/using-google-for-ocr/</link>
		<comments>http://blog.yoavfarhi.com/2008/11/01/using-google-for-ocr/#comments</comments>
		<pubDate>Sat, 01 Nov 2008 02:52:10 +0000</pubDate>
		<dc:creator>Yoav</dc:creator>
				<category><![CDATA[Random]]></category>
		<category><![CDATA[gmail]]></category>
		<category><![CDATA[google]]></category>
		<category><![CDATA[ocr]]></category>
		<category><![CDATA[pdf to text]]></category>

		<guid isPermaLink="false">http://blog.yoavfarhi.com/?p=145</guid>
		<description><![CDATA[Amit Agarwal has posted a tip on his blog about using Google to convert PDF to text.Â  For some reason, he suggest putting all your PDFs documents on the web: Create a folder in your website (say abc.com/pdf) and upload all the PDF images to that folder. Now create a public web page that links [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=blog.yoavfarhi.com&amp;blog=30234816&amp;post=145&amp;subd=yoavfarhi&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>Amit Agarwal has posted a tip on his blog about using <a href="http://www.labnol.org/software/convert-scanned-pdf-images-to-text-with-google-ocr/5158/">Google to convert PDF to text</a>.Â  For some reason, he suggest putting all your PDFs documents on the web:</p>
<blockquote><p>Create a folder in your website (say abc.com/pdf) and upload all the PDF images to that folder. Now create a public web page that links to all the PDF files. Wait for the Google bots to spider your stuff.</p>
<p>Once done, type the query &#8220;site:abc.com/pdf filetype:pdf&#8221; to see the PDF documents as HTML.</p></blockquote>
<p>Why would you want your documents to be accessible by anyone? Why wait for Google to index your page?</p>
<p>Thereâ€™s a much easier way Iâ€™ve been using, and one of the commentators on Agawalâ€™s blog points it out:</p>
<blockquote><p>You can upload the Scanned PDFs to Gmail and sent it you only. Then Open your Inbox and the mail sent from you, you have an option to View as HTML. That will solve the Hosting problem.</p></blockquote>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/yoavfarhi.wordpress.com/145/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/yoavfarhi.wordpress.com/145/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/yoavfarhi.wordpress.com/145/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/yoavfarhi.wordpress.com/145/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/yoavfarhi.wordpress.com/145/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/yoavfarhi.wordpress.com/145/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/yoavfarhi.wordpress.com/145/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/yoavfarhi.wordpress.com/145/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/yoavfarhi.wordpress.com/145/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/yoavfarhi.wordpress.com/145/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/yoavfarhi.wordpress.com/145/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/yoavfarhi.wordpress.com/145/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/yoavfarhi.wordpress.com/145/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/yoavfarhi.wordpress.com/145/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=blog.yoavfarhi.com&amp;blog=30234816&amp;post=145&amp;subd=yoavfarhi&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://blog.yoavfarhi.com/2008/11/01/using-google-for-ocr/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/fe9a6432e7e9d541ce8fe9574b1637ca?s=96&#38;d=http%3A%2F%2Fs0.wp.com%2Fi%2Fmu.gif&#38;r=G" medium="image">
			<media:title type="html">Yoav</media:title>
		</media:content>
	</item>
	</channel>
</rss>
