[SPAM 1.3] Re: python for solr

From: Andrew Hankinson <andrew.hankinson_at_nyob>
Date: Wed, 31 Aug 2016 21:00:31 +0100
To: CODE4LIB_at_LISTS.CLIR.ORG
Spam detection software, running on the system "avery.infomotions.com",
has identified this incoming email as possible spam.  The original
message has been attached to this so you can view it or label
similar future email.  If you have any questions, see
eric_morgan_at_infomotions.com for details.

Content preview:  Of all of them, the only one I've found that is python 3 compatible
   is pysolr : https://github.com/django-haystack/pysolr It has been pretty
  solid, actively developed, and the devs are pretty responsive to pull requests.
   [...] 

Content analysis details:   (1.3 points, 1.1 required)

 pts rule name              description
---- ---------------------- --------------------------------------------------
 2.7 DNS_FROM_AHBL_RHSBL    RBL: Envelope sender listed in dnsbl.ahbl.org
                            [listed in lists.clir.org.rhsbl.ahbl.org.	IN]
                            [A]
 1.0 RCVD_IN_SORBS_SPAM     RBL: SORBS: sender is a spam source
                            [74.125.82.53 listed in dnsbl.sorbs.net]
 0.0 DKIM_ADSP_CUSTOM_MED   No valid author signature, adsp_override is
                            CUSTOM_MED
 0.0 T_HEADER_FROM_DIFFERENT_DOMAINS From and EnvelopeFrom 2nd level mail
                            domains are different
-0.0 SPF_PASS               SPF: sender matches SPF record
 0.0 FREEMAIL_FROM          Sender email is commonly abused enduser mail provider
                            (andrew.hankinson[at]gmail.com)
-0.6 RP_MATCHES_RCVD        Envelope sender domain matches handover relay domain
 0.0 HEADER_FROM_DIFFERENT_DOMAINS From and EnvelopeFrom 2nd level mail
                            domains are different
-0.0 SPF_HELO_PASS          SPF: HELO matches SPF record
-1.9 BAYES_00               BODY: Bayes spam probability is 0 to 1%
                            [score: 0.0000]
 0.0 FREEMAIL_FORGED_FROMDOMAIN 2nd level domains in From and EnvelopeFrom
                             freemail headers are different
 0.0 T_FREEMAIL_FORGED_FROMDOMAIN 2nd level domains in From and
                            EnvelopeFrom freemail headers are different



attached mail follows:


Of all of them, the only one I've found that is python 3 compatible is pysolr : https://github.com/django-haystack/pysolr

It has been pretty solid, actively developed, and the devs are pretty responsive to pull requests.

> On Aug 31, 2016, at 8:42 PM, Chris Gray <cpgray_at_UWATERLOO.CA> wrote:
> 
> I haven't done much of that but you can submit documents via the API and have them indexed (and processed by Tika).  Once you understand how to do that, you might find that you can do everything you want to do.
> 
> An alternative would be reading the source of one of those libraries.  In the list you referenced, the only mention of inserting documents was for sunburnt.  I would be inclined to look there first, especially since it mentions a pythonic interface to Solr.
> 
> A good, and amusing, cautionary tale about overwritten Python libraries is at https://www.youtube.com/watch?v=o9pEzgHorH0.
> 
> Chris
> 
>> On 2016-08-31 03:28 PM, Eric Lease Morgan wrote:
>> On Aug 31, 2016, at 3:25 PM, Chris Gray <cpgray_at_UWATERLOO.CA> wrote:
>> 
>>>> Okay, there are SO many Python libraries [1] for Solr, and I’d like to know which one is the most popular (not necessarily the “best”).
>>> What do you want to do with it?
>>> 
>>> I didn't feel the need to even look for a Python library for my needs.  I use Python to submit searches to the Solr web API and consume the results as JSON.
>> 
>> Good question. I want to add documents to a Solr index, and I want to query the same index. Hmmm… —Eric M.
Received on Wed Aug 31 2016 - 16:00:39 EDT