Welcome to Squishdot Websites Squishdot How-To Newbies
 about
 search
 post article
 Documentation
 Mailing Lists
 Bug Tracking
 Development
 Installation
 Upgrading
 Download
 admin
 rdf

 main


Are Chinese characters searchable?
Squishdot Posted by Silas Lin on Wednesday December 29, 02:19AM, 1999
from the i-wish-zope-would-work-with-all-languages dept.
Are Chinese characters searchable?
中文可以搜尋得到嗎?
In the last test, Only author can be found.

Squishdot is my favorite news generator. I found it's very cool, and every thing is OK with Chinese characters, except the search function.

I thought it's the problem of Zope because Confera has the same problem. I do know the real problem is the incompatibility of 2-byte code. But I don't know how to solve it. Can anyone help to solve this problem?

<  |  >

 

Related Links
  • Articles on Squishdot
  • Also by Silas Lin
  • Contact author
  • The Fine Print: The following comments are owned by whoever posted them.
    ( Reply )

    Re: Are Chinese characters searchable?
    by Butch Landingin on Wednesday December 29, 03:19PM, 1999
    Unfortunately, not at this current time -- this is due to the indexing classes (including the new ZCatalog) not being "internationalized" yet.

    Michel Pelletier (of Digital Creations, the makers of Zope) is working to make ZCatalog support an "internationalized" version. However, it may still be up to the Zope community/volunteers to create one for their own languages...

    This means that Squishdot may not be able to do searches in other languages until:

    1. ZCatalog is "internationalized" and "local(e)ized"

    2. Squishdot is modified to use ZCatalog.

    There are also other internationalization issues that haven't been resolved -- for one, Python's support for Unicode -- which will hopefully be addressed with the next Python release...

    Nevertheless, things are moving forward :^).

    HTH.

    Butch
    [ Reply to this ]
    Re: Are Chinese characters searchable?
    by Brad Collins on Thursday December 30, 08:26AM, 1999
    I'll be looking into this for a project we have to do which will be in English, Japanese and Chinese.

    Drop me email if you don't see anything here in a week or so....
    [ Reply to this ]
    Re: Are Chinese characters searchable?
    by dfbb on Saturday January 01, 03:14AM, 2000
    You can setup another perl serach enginee
    to seach chinese chars, It is good enough
    try get it from woju.o3.net
    [ Reply to this ]
    • Re: Are Chinese characters searchable?
      by jon on Thursday January 06, 02:19AM, 2000
      I checked out woju.o3.net and couldn't find the
      app you mentioned. Perhaps you could give its
      name. Any more details would be greatly
      appreciated. Thanks. Jon

      [ Reply to this ]
      • Re: Are Chinese characters searchable?
        by sean on Thursday August 16, 08:03PM, 2001
        just insert the google search form on your site... actually it's august 2001 and i think ZCatalog supports Unicode (not completely, but...) so SQuishdot should be able to search Chinese. but i think it can't, i just tried to search this web site and it didn't work...
        [ Reply to this ]
    Re: Are Chinese characters searchable?
    by Pavlos on Wednesday January 12, 08:06PM, 2000
    Chinese characters are not so easy to index, because there are no word boundaries. Do you index by character? By a group of characters? In the end indexing might not be any more efficient than brute force searching, which is what we use on our current site:
    http://www.yifannet.com/Yifan/News/

    If you perform a text search it will iterate over all (3000) postings and check for the given string. It is actually quite fast considering we are having approx 40,000 requests per day
    [ Reply to this ]
    The Fine Print: The following comments are owned by whoever posted them.
    ( Reply )

    Powered by Zope  Squishdot Powered
      "Any system that depends on reliability is unreliable." -- Nogg's Postulate
    All trademarks and copyrights on this page are owned by their respective companies. Comments are owned by the Poster. The Rest ©1999 Butch Landingin, ©2000-2002 Chris Withers.