"Full Text Search" algorithms?

Discussion in 'Computer Science & Culture' started by jynx_d5, Sep 6, 2002.

Thread Status:
Not open for further replies.
  1. jynx_d5 Registered Member

    Messages:
    7
    Hi all of you geeks out there,

    I'm need an efficient way to search a large amount of English text (plain text) for a given word (or set of words). I am searching for a code snippet or an algorithm. I found none. I couldn't even find any material to describe such methods or algorithms.

    If anyone can provide a link to some site or anything, that would be of great help.

    Please Register or Log in to view the hidden image!

     
  2. Google AdSense Guest Advertisement



    to hide all adverts.
  3. Adam §Þ@ç€ MØnk€¥ Registered Senior Member

    Messages:
    7,415
    For what operating system, and what type of project?
     
  4. Google AdSense Guest Advertisement



    to hide all adverts.
  5. jynx_d5 Registered Member

    Messages:
    7
    It don't matter. My OS is WinXP. I write code in VC++, VB, C#, you name it! I just need an idea better than comparing sequentially every word in the text to the word(s) I'm trying to locate. You know, something like indexing or ...? Thanx

    Please Register or Log in to view the hidden image!

     
  6. Google AdSense Guest Advertisement



    to hide all adverts.
  7. Adam §Þ@ç€ MØnk€¥ Registered Senior Member

    Messages:
    7,415
    I was just thinking that Grep probably does most of what you want to do. Grep and WC and a few other Unix tools.
     
  8. jynx_d5 Registered Member

    Messages:
    7
    I'll have a look at those. Although, I don't think Grep is much of use. I think it does the same simple search!
     
  9. sjmarsha Registered Senior Member

    Messages:
    363
    No. Adam is right. grep is the best way. You can search multiple documents for any text string you like.
     
  10. prozak Banned Banned

    Messages:
    782
    Are you asking for an algorithm, or an existing piece of code that does this?
     
  11. jynx_d5 Registered Member

    Messages:
    7
    Thanx prozak for understanding me and not suggesting a utitlity.

    I'd love to find an algorithm. But, a source code will be great too.

    Thanx again.

    Please Register or Log in to view the hidden image!

     
  12. prozak Banned Banned

    Messages:
    782
    Cool, maybe i'm useful in some limited context. Here's what a quick search found:

    There are several search utilities/engines with source code on this page:

    http://maya.cs.depaul.edu/~classes/ds575/tools.html

    And from that, this looks like the most complete example of the whole nine yards:

    http://maya.cs.depaul.edu/~classes/ds575/software/waistool/source/ir/

    I'm a little bit fried from looking at computer screens right now, but a quick glance suggests someone with more diligence than I will find something at one of the two sources above... sorry this is vague. With more energy I will do more tomorrow if needed and possible.
     
  13. jynx_d5 Registered Member

    Messages:
    7
    Maybe...? Man, you made my day. You found me a treasure. I can't tell you how much grateful I am. I owe you big time.
     
  14. sjmarsha Registered Senior Member

    Messages:
    363
    most programming languages have a grep facility, why not use that?

    Its easier.

    (Unless you are trying to pass off someone elses algorithm as your own of course...)
     
  15. jynx_d5 Registered Member

    Messages:
    7
    I am very much offended. Thank you.

    Please Register or Log in to view the hidden image!

     
Thread Status:
Not open for further replies.

Share This Page