Hi Hui, I have used the source_url function in the devtools package with good results. Give it a shot!
Best, Jorge.- On Tue, Mar 10, 2015 at 9:39 AM, Hui Du <hui...@savvyrookies.com> wrote: > Hi All, > > I am trying to parse some information from website, say, a linkedin page. > The linkedin url was > > url = "http://www.linkedin.com/in/huidu" > > I had no problem to use readLines and XML package to collect the > information I need. However, that url became " > https://www.linkedin.com/in/huidu" now. > > url = "https://www.linkedin.com/in/huidu" > > It failed readLines function. > > > readLines(url) > Error in file(con, "r") : cannot open the connection > In addition: Warning message: > In file(con, "r") : unsupported URL scheme > > > Do you know any way to read-in web information if the url is https? Thanks > a lot. > > Hui > > [[alternative HTML version deleted]] > > ______________________________________________ > R-help@r-project.org mailing list -- To UNSUBSCRIBE and more, see > https://stat.ethz.ch/mailman/listinfo/r-help > PLEASE do read the posting guide > http://www.R-project.org/posting-guide.html > and provide commented, minimal, self-contained, reproducible code. > [[alternative HTML version deleted]] ______________________________________________ R-help@r-project.org mailing list -- To UNSUBSCRIBE and more, see https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.