[SPAM 1.3] Re: python for solr

From: Rhoads, Joseph <joseph_rhoads_at_nyob>
Date: Wed, 31 Aug 2016 16:09:22 -0400
To: CODE4LIB_at_LISTS.CLIR.ORG
Spam detection software, running on the system "avery.infomotions.com",
has identified this incoming email as possible spam.  The original
message has been attached to this so you can view it or label
similar future email.  If you have any questions, see
eric_morgan_at_infomotions.com for details.

Content preview:  I've used several of these. I like the interface of mysolr
   but (as mentioned) it hasn't been updated in a while. pysolr is fairly up
   to date (v3.5 came out in May this year), and is used in django-haystack
  for the solr backend. https://github.com/django-haystack/pysolr [...] 

Content analysis details:   (1.3 points, 1.1 required)

 pts rule name              description
---- ---------------------- --------------------------------------------------
 2.7 DNS_FROM_AHBL_RHSBL    RBL: Envelope sender listed in dnsbl.ahbl.org
                            [listed in lists.clir.org.rhsbl.ahbl.org.	IN]
                            [A]
 1.0 RCVD_IN_SORBS_SPAM     RBL: SORBS: sender is a spam source
                            [209.85.213.174 listed in dnsbl.sorbs.net]
 0.0 T_HEADER_FROM_DIFFERENT_DOMAINS From and EnvelopeFrom 2nd level mail
                            domains are different
-0.0 SPF_PASS               SPF: sender matches SPF record
-0.6 RP_MATCHES_RCVD        Envelope sender domain matches handover relay domain
 0.0 HEADER_FROM_DIFFERENT_DOMAINS From and EnvelopeFrom 2nd level mail
                            domains are different
-0.0 SPF_HELO_PASS          SPF: HELO matches SPF record
-1.9 BAYES_00               BODY: Bayes spam probability is 0 to 1%
                            [score: 0.0000]



attached mail follows:


I've used several of these.  I like the interface of mysolr but (as
mentioned) it hasn't been updated in a while.

pysolr is fairly up to date (v3.5 came out in May this year), and is used
in django-haystack for the solr backend.
https://github.com/django-haystack/pysolr

Haystack itself is great if you want an ORM-like interface for solr and use
django.
https://github.com/django-haystack/django-haystack

-Joseph



On Wed, Aug 31, 2016 at 3:42 PM, Chris Gray <cpgray_at_uwaterloo.ca> wrote:

> I haven't done much of that but you can submit documents via the API and
> have them indexed (and processed by Tika).  Once you understand how to do
> that, you might find that you can do everything you want to do.
>
> An alternative would be reading the source of one of those libraries.  In
> the list you referenced, the only mention of inserting documents was for
> sunburnt.  I would be inclined to look there first, especially since it
> mentions a pythonic interface to Solr.
>
> A good, and amusing, cautionary tale about overwritten Python libraries is
> at https://www.youtube.com/watch?v=o9pEzgHorH0.
>
> Chris
>
>
> On 2016-08-31 03:28 PM, Eric Lease Morgan wrote:
>
>> On Aug 31, 2016, at 3:25 PM, Chris Gray <cpgray_at_UWATERLOO.CA> wrote:
>>
>> Okay, there are SO many Python libraries [1] for Solr, and I’d like to
>>>> know which one is the most popular (not necessarily the “best”).
>>>>
>>> What do you want to do with it?
>>>
>>> I didn't feel the need to even look for a Python library for my needs.
>>> I use Python to submit searches to the Solr web API and consume the results
>>> as JSON.
>>>
>>
>> Good question. I want to add documents to a Solr index, and I want to
>> query the same index. Hmmm… —Eric M.
>>
>
Received on Wed Aug 31 2016 - 16:09:47 EDT