Riak Search. Indexing special chars. Internal Server Error.

Vladimir Shapovalov shapovalov at gmail.com
Mon Aug 20 10:43:08 EDT 2012


Hi Ryan,

I'm  using ver. 1.2.0 too. It happens only on bucket with indexing enabled.
I think it depends on the Analyzer factory.
My index schema was:
...
        {dynamic_field, [
            {name, "*"},
            {type, string},
            {analyzer_factory, {erlang, text_analyzers,  *
whitespace_analyzer_factory*}}
        ]}
...

Since I've changed the default factory from
whitespace_analyzer_factory to standard_analyzer_factory
it seems to work.
My current index schema is:
...
        {dynamic_field, [
            {name, "*"},
            {type, string},
            {analyzer_factory, {erlang, text_analyzers, *
standard_analyzer_factory*}}
        ]}
...

It looks like the whitespace_analyzer_factory has some differences compare
to standard_analyzer_factory.

Thanks
Vladimir

On Mon, Aug 20, 2012 at 3:50 PM, Ryan Zezeski <rzezeski at basho.com> wrote:

>
> On Mon, Aug 20, 2012 at 6:24 AM, Vladimir Shapovalov <shapovalov at gmail.com
> > wrote:
>
>> Hi all,
>>
>> I'm trying to put some data in a bucket, witch has indexing enabled.
>> One key value has a special char ß (geman S-sharp<http://en.wikipedia.org/wiki/%C3%9F>).
>> It is UTF-8 encoded.
>>
>> The errror message i get:
>>
>> ---------------------------------------------------------------------------------------------
>>
>> Internal Server ErrorThe server encountered an error while processing this request:
>>
>> {error,
>>     {error,badarg,
>>         [{erlang,iolist_to_binary,
>>              [{hook_crashed,
>>                   {riak_search_kv_hook,precommit,error,function_clause}}]},
>>          {wrq,append_to_response_body,2},
>>          {riak_kv_wm_object,handle_common_error,3},
>>          {webmachine_resource,resource_call,3},
>>          {webmachine_resource,do,3},
>>          {webmachine_decision_core,resource_call,1},
>>          {webmachine_decision_core,accept_helper,0},
>>          {webmachine_decision_core,decision,1}]}}
>>
>> ------------------------------
>> mochiweb+webmachine web server
>>
>>
>> ---------------------------------------------------------------------------------------------
>>
>> Could anyone tell me please, how to store data special chars included correctly?
>>
>> How to deal with non ASCII chars in Riak, say coming from Russian or German languages?
>>
>> And why it causes riak_search_kv_hook to crash?
>>
>>
>> Thanks in Advance!
>>
>> Vladimir
>>
>>
> Valdimir,
>
> I tried different combinations of things to reproduce your error: key
> containing ß, bucket containing ß, plain text value with ß, JSON with ß,
> etc.  Everything worked fine for me and I got the expected number of
> results.
>
> However, I tested on master (essentially version 1.2.0).  What version of
> Riak are you using?
>
> -Z
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.basho.com/pipermail/riak-users_lists.basho.com/attachments/20120820/5c7761e0/attachment.html>


More information about the riak-users mailing list