Re: [tor-dev] Sanitizing and publishing our web server logs

2011-12-01 Thread Runa A. Sandvik
On Tue, Oct 18, 2011 at 8:27 AM, Karsten Loesing wrote: > The webalizer output for www.torproject.org can be viewed here: > > http://freehaven.net/~karsten/volatile/www.torproject.org-webalizer/ I have looked into four different web log analysis tools, see https://trac.torproject.org/projects/tor

Re: [tor-dev] Sanitizing and publishing our web server logs

2011-10-18 Thread Karsten Loesing
On 8/25/11 10:08 AM, Karsten Loesing wrote: > we have been discussing sanitizing and publishing our web server logs > for quite a while now. The idea is to remove all potentially sensitive > parts from the logs, publish them in monthly tarballs on the metrics > website, and analyze them for top vi

Re: [tor-dev] Sanitizing and publishing our web server logs

2011-09-14 Thread Andrew Lewman
On Friday, September 02, 2011 10:08:37 Brian Szymanski wrote: > What exactly are we hoping to gain from the analysis of the (hopefully > correctly) stripped logs? Overall, all of our data collected can be analyzed to see if any of it can be used to discover users, sets of users, or other personal

Re: [tor-dev] Sanitizing and publishing our web server logs

2011-09-14 Thread Brian Szymanski
What exactly are we hoping to gain from the analysis of the (hopefully correctly) stripped logs? On 09/02/2011 09:06 AM, Sebastian Hahn wrote: On Sep 2, 2011, at 2:46 PM, Karsten Loesing wrote: Hi Andrew, On 9/2/11 2:18 AM, Andrew Lewman wrote: On Thursday, August 25, 2011 04:08:00 Karsten

Re: [tor-dev] Sanitizing and publishing our web server logs

2011-09-05 Thread Karsten Loesing
On 9/2/11 7:32 PM, Marsh Ray wrote: > On 08/25/2011 03:08 AM, Karsten Loesing wrote: >> Hi everyone, >> >> we have been discussing sanitizing and publishing our web server logs >> for quite a while now. The idea is to remove all potentially sensitive >> parts from the logs, publish them in monthly

Re: [tor-dev] Sanitizing and publishing our web server logs

2011-09-05 Thread Karsten Loesing
On 9/2/11 3:06 PM, Sebastian Hahn wrote: > > On Sep 2, 2011, at 2:46 PM, Karsten Loesing wrote: > >> Hi Andrew, >> >> On 9/2/11 2:18 AM, Andrew Lewman wrote: >>> On Thursday, August 25, 2011 04:08:00 Karsten Loesing wrote: we have been discussing sanitizing and publishing our web server logs

Re: [tor-dev] Sanitizing and publishing our web server logs

2011-09-02 Thread Marsh Ray
On 08/25/2011 03:08 AM, Karsten Loesing wrote: Hi everyone, we have been discussing sanitizing and publishing our web server logs for quite a while now. The idea is to remove all potentially sensitive parts from the logs, publish them in monthly tarballs on the metrics website, and analyze them

Re: [tor-dev] Sanitizing and publishing our web server logs

2011-09-02 Thread Sebastian Hahn
On Sep 2, 2011, at 2:46 PM, Karsten Loesing wrote: > Hi Andrew, > > On 9/2/11 2:18 AM, Andrew Lewman wrote: >> On Thursday, August 25, 2011 04:08:00 Karsten Loesing wrote: >>> we have been discussing sanitizing and publishing our web server logs >>> for quite a while now. The idea is to remove

Re: [tor-dev] Sanitizing and publishing our web server logs

2011-09-02 Thread Karsten Loesing
Hi Andrew, On 9/2/11 2:18 AM, Andrew Lewman wrote: > On Thursday, August 25, 2011 04:08:00 Karsten Loesing wrote: >> we have been discussing sanitizing and publishing our web server logs >> for quite a while now. The idea is to remove all potentially sensitive >> parts from the logs, publish them

Re: [tor-dev] Sanitizing and publishing our web server logs

2011-09-01 Thread Andrew Lewman
On Thursday, August 25, 2011 04:08:00 Karsten Loesing wrote: > we have been discussing sanitizing and publishing our web server logs > for quite a while now. The idea is to remove all potentially sensitive > parts from the logs, publish them in monthly tarballs on the metrics > website, and analyz

[tor-dev] Sanitizing and publishing our web server logs

2011-08-25 Thread Karsten Loesing
Hi everyone, we have been discussing sanitizing and publishing our web server logs for quite a while now. The idea is to remove all potentially sensitive parts from the logs, publish them in monthly tarballs on the metrics website, and analyze them for top visited pages, top downloaded packages,