Hi all,

We now have most of the Indian central and state gazettes archived at
https://archive.org/details/gazetteofindia?sort=-date

There are crawlers running daily out of the code at the egazette
<https://github.com/sushant354/egazette> repo and my temporary fork
<https://github.com/ramSeraph/egazette> of the same.

One of the advantages of having the data at archive.org is that it comes
with automatic OCR(using tesseract), a free text search engine and a
possibility to get a RSS feed based on a search query. I hope people
build some useful things with it.

The following states and union territories currently have problems:

   1. *Andaman and Nicobar islands*: Site doesn't have current data.
   2. *Jammu and Kashmir*: Site is offline. Hopefully temporarily.
   3. *Mizoram:* Data is not being updated at source.
   4. *Meghalaya:* Data delayed by 3 months
   5. *West Bengal*: No gazette site could be found. Would appreciate it if
   anyone can locate it( https://www.wbgazettepart2.in/ is not it ).

Thanks,
Sreeram K

-- 
Datameet is a community of Data Science enthusiasts in India. Know more about 
us by visiting http://datameet.org
--- 
You received this message because you are subscribed to the Google Groups 
"datameet" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To view this discussion visit 
https://groups.google.com/d/msgid/datameet/CAMgvHC5sttm0hoajbFySGRRVHUmHKM2d3e-_NtmpooSUxAd1OQ%40mail.gmail.com.

Reply via email to