Spam detection software, running on the system "avery.infomotions.com", has identified this incoming email as possible spam. The original message has been attached to this so you can view it or label similar future email. If you have any questions, see eric_morgan_at_infomotions.com for details. Content preview: Solr itself has an internal limit to the number of results you can return on a single page (I think it is 1000) and AFAIK always returns a paged result. For speed and memory usage over large result sets it would probably be most efficient to build in paging logic. [...] Content analysis details: (1.3 points, 1.1 required) pts rule name description ---- ---------------------- -------------------------------------------------- 1.0 RCVD_IN_SORBS_SPAM RBL: SORBS: sender is a spam source [74.125.82.43 listed in dnsbl.sorbs.net] 2.7 DNS_FROM_AHBL_RHSBL RBL: Envelope sender listed in dnsbl.ahbl.org [listed in lists.clir.org.rhsbl.ahbl.org. IN] [A] 0.0 FREEMAIL_FROM Sender email is commonly abused enduser mail provider (andrew.hankinson[at]gmail.com) -0.0 SPF_PASS SPF: sender matches SPF record -0.6 RP_MATCHES_RCVD Envelope sender domain matches handover relay domain 0.0 T_HEADER_FROM_DIFFERENT_DOMAINS From and EnvelopeFrom 2nd level mail domains are different -0.0 SPF_HELO_PASS SPF: HELO matches SPF record 0.0 HEADER_FROM_DIFFERENT_DOMAINS From and EnvelopeFrom 2nd level mail domains are different 0.0 DKIM_ADSP_CUSTOM_MED No valid author signature, adsp_override is CUSTOM_MED -1.9 BAYES_00 BODY: Bayes spam probability is 0 to 1% [score: 0.0000] 0.0 FREEMAIL_FORGED_FROMDOMAIN 2nd level domains in From and EnvelopeFrom freemail headers are different 0.0 T_FREEMAIL_FORGED_FROMDOMAIN 2nd level domains in From and EnvelopeFrom freemail headers are different
attached mail follows:
Solr itself has an internal limit to the number of results you can return on a single page (I think it is 1000) and AFAIK always returns a paged result. For speed and memory usage over large result sets it would probably be most efficient to build in paging logic. > On Aug 31, 2016, at 10:45 PM, Tod Olson <tod_at_UCHICAGO.EDU> wrote: > > On a related note, do any of the libraries allow the user to iterate over a large result set without having to be aware of repeated calls, incrementing the start parameter, and that sort of bookkeeping? > > It seems like someone must have built an iterator to hide that when you're trying to sift through a large number of hits. > > -Tod > >> On Aug 31, 2016, at 4:09 PM, Rhoads, Joseph <joseph_rhoads_at_BROWN.EDU> wrote: >> >> I've used several of these. I like the interface of mysolr but (as >> mentioned) it hasn't been updated in a while. >> >> pysolr is fairly up to date (v3.5 came out in May this year), and is used >> in django-haystack for the solr backend. >> https://github.com/django-haystack/pysolr >> >> Haystack itself is great if you want an ORM-like interface for solr and use >> django. >> https://github.com/django-haystack/django-haystack >> >> -Joseph >> >> >> >>> On Wed, Aug 31, 2016 at 3:42 PM, Chris Gray <cpgray_at_uwaterloo.ca> wrote: >>> >>> I haven't done much of that but you can submit documents via the API and >>> have them indexed (and processed by Tika). Once you understand how to do >>> that, you might find that you can do everything you want to do. >>> >>> An alternative would be reading the source of one of those libraries. In >>> the list you referenced, the only mention of inserting documents was for >>> sunburnt. I would be inclined to look there first, especially since it >>> mentions a pythonic interface to Solr. >>> >>> A good, and amusing, cautionary tale about overwritten Python libraries is >>> at https://www.youtube.com/watch?v=o9pEzgHorH0. >>> >>> Chris >>> >>> >>>> On 2016-08-31 03:28 PM, Eric Lease Morgan wrote: >>>> >>>> On Aug 31, 2016, at 3:25 PM, Chris Gray <cpgray_at_UWATERLOO.CA> wrote: >>>> >>>> Okay, there are SO many Python libraries [1] for Solr, and I’d like to >>>>>> know which one is the most popular (not necessarily the “best”). >>>>> What do you want to do with it? >>>>> >>>>> I didn't feel the need to even look for a Python library for my needs. >>>>> I use Python to submit searches to the Solr web API and consume the results >>>>> as JSON. >>>> >>>> Good question. I want to add documents to a Solr index, and I want to >>>> query the same index. Hmmm… —Eric M. >Received on Thu Sep 01 2016 - 04:59:20 EDT