Re: Request to Join

2021-08-31 Thread Andrew M.A. Cater
On Tue, Aug 31, 2021 at 08:05:34AM +0200, Rob Le wrote:
> Hello there, hope all is well.
> 
>  
> 
> My team and I are interested in getting involved in the local Debian 
> community.  We live and work in the San Francisco community and looking 
> for way to be more supportive to our favorite open-source projects.
> 

Hi Rob,

That's good to hear. There's a flourishing Debian community in SF, I thnk
[I'm in UK] Try: https://wiki.debian.org/LocalGroups#CA_-_California

The community team is here primarily to make Debian a welcoming place and 
to see that people work well together on mailing lists and IRC.

If you're developers and new to Debian, you might want the Debian mentors
list to find out how Debian packages things. Otherwise, there's debian-user
and debian-devel which is primarily a list for established Debian developers
/ raising technical queries to the developer community as a whole.

As ever, it's sometimes worth looking over the list archives to find
out how stuff works - https://www.debian.org/MailingLists/ should give you
everything you need to know.

A couple of small things - please turn off HTML if you're able to and 
try to avoid top posting when replying to lists.

All the very best, as ever,

Andy Cater

[For the community team].

>  
> 
> Please let us know anyway we can be of service.
> 
>  
> 
> Thanks,
> 
>  
> 
> Rob Wingate



Re: Request to Join

2021-08-31 Thread Paul Wise
On Tue, Aug 31, 2021 at 9:29 AM Andrew M.A. Cater wrote:

> If you're developers and new to Debian, you might want the Debian mentors
> list to find out how Debian packages things.

In addition to packaging, there are many other ways to help Debian,
both technical and non-technical, most of them are listed here:

https://www.debian.org/intro/help

-- 
bye,
pabs

https://wiki.debian.org/PaulWise



Re: Q: Use https for {deb,security}.debian.org by default

2021-08-31 Thread Helmut Grohne
TL;DR: Any encrypted transport protocol (e.g. https) breaks caching.

On Sat, Aug 21, 2021 at 10:40:32AM +0200, Wouter Verhelst wrote:
> This could be useful for both the "I've got a slow uplink and would like
> it to not be overwhelmed at the BSP I'm hosting for my Debian friends"
> type as well as the "I'm an ISP and I want to provide a mirror to Debian
> users so we can reduce our uplink connection a bit" type of situations.

Yes, please. Another situation is: "My home houses around 10 Debian
machines, some of which are mobile."

I'm actually doing this since around 10 years.

My first implementation (predating both auto-apt-proxy and
squid-deb-proxy-client) was a custom DHCP option containing a proxy.
I've locally used that with success for quite a while in my own
environments. It does have a few downsides:
 * Integrating it with isc-dhcp-client was impossible in a
   policy-compliant way.
 * IPv4 only.
 * As others have pointed out: Breaks with https.

Other commenters mentiond squid-deb-proxy-client using
_apt_proxy._tcp. via avahi.

auto-apt-proxy has a few techniques beyond that.

Unfortunately, none of them are enabled by default, so in general it
doesn't just work.

In any case, I for one would actively opt out of https, because it would
make apt unusable for me. My cache has a byte hit rate around 90% - 95%.
It seems to deliver around 4TB of .debs and stuff per year here. How
would you get that through DSL? Using https would make apt unbearably
slow for me.

Honouring _apt_proxy._tcp. by default would be cool indeed.
apt-cacher-ng automatically publishes this as does squid-deb-proxy.
However doing so would completely break if we were to go https by
default.

Helmut



Bug#993386: ITP: sqlite-fts4 -- Document scoring Python library for SQLite FTS4

2021-08-31 Thread Edward Betts
Package: wnpp
Severity: wishlist
Owner: Edward Betts 
X-Debbugs-Cc: debian-devel@lists.debian.org, debian-pyt...@lists.debian.org

* Package name: sqlite-fts4
  Version : 1.0.1
  Upstream Author : Simon Willison
* URL : https://github.com/simonw/sqlite-fts4
* License : Apache 2.0
  Programming Lang: Python
  Description : Document scoring Python library for SQLite FTS4

Custom SQLite functions written in Python for ranking documents indexed
using the FTS4 extension.

## rank_score()

This is an extremely simple ranking function, based on an example in the
SQLite documentation. It generates a score for each document using the
sum of the score for each column. The score for each column is
calculated as the number of search matches in that column divided by the
number of search matches for every column in the index - a classic
TF-IDF calculation.

## rank_bm25()

An implementation of the Okapi BM25 scoring algorithm.

## decode_matchinfo()

SQLite's built-in matchinfo() function returns results as a binary
string. This binary represents a list of 32 bit unsigned integers, but
reading the binary results is not particularly human-friendly.

## annotate_matchinfo()

This function decodes the matchinfo document into a verbose JSON
structure that describes exactly what each of the returned integers
actually means.

Blog post about the creation of this library:
https://simonwillison.net/2019/Jan/7/exploring-search-relevance-algorithms-sqlite/

This is a dependency of the sqlite-utils tool by the same author.

I plan to maintain this package as part of the python modules team.