Hi all, We now have most of the Indian central and state gazettes archived at https://archive.org/details/gazetteofindia?sort=-date
There are crawlers running daily out of the code at the egazette <https://github.com/sushant354/egazette> repo and my temporary fork <https://github.com/ramSeraph/egazette> of the same. One of the advantages of having the data at archive.org is that it comes with automatic OCR(using tesseract), a free text search engine and a possibility to get a RSS feed based on a search query. I hope people build some useful things with it. The following states and union territories currently have problems: 1. *Andaman and Nicobar islands*: Site doesn't have current data. 2. *Jammu and Kashmir*: Site is offline. Hopefully temporarily. 3. *Mizoram:* Data is not being updated at source. 4. *Meghalaya:* Data delayed by 3 months 5. *West Bengal*: No gazette site could be found. Would appreciate it if anyone can locate it( https://www.wbgazettepart2.in/ is not it ). Thanks, Sreeram K -- Datameet is a community of Data Science enthusiasts in India. Know more about us by visiting http://datameet.org --- You received this message because you are subscribed to the Google Groups "datameet" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To view this discussion visit https://groups.google.com/d/msgid/datameet/CAMgvHC5sttm0hoajbFySGRRVHUmHKM2d3e-_NtmpooSUxAd1OQ%40mail.gmail.com.
