Stuck on simple search

rald · August 24, 2023, 12:48pm

Two things I’d like to add to this old discussion:

I think there is a core to the frustration arising, that has not yet been formulated here: If one wants a very quick and dirty, broad and coarse query – one does not want to write a very detailed and specific query.
In line with that, f.e. google search works by “the more precisely you know what you want, the more instructions you add”, whereas the current implementation in TBX has the logic of “the more fuzzy you want your search, the more instructions you have to add”.
I think the problem with the ladder approach is, that opening up a query is a lot less straight forward than narrowing down: See how complex the query @pat wrote is and compare that with the effort of narrowing down a google search to an exact phrase using " ".
An “Exact Phrase” option, that can be toggled is, how f.e. the Preview app in macOS deals with this issue. Hidden behind the little dropdown arrow next to the search icon.

Pro: You get to choose gears
Con: You might forget what gear you’ve set.
Not noticing that search does not behave the way you expect is non-trivial. See users reporting, that they always assumed, their input would be treated with an implicit AND operator.
Resolution: Visual indicator showing what search mode is active. Imagine f.e. quotation marks around the loupe icon.

Curious to hear if there has been changes regarding this – fiddling around and quickly searching documentations it did not seem like it to me. Finding an elegant way to navigate these issues and especially looking at my point 1. seems like a very tinderboxy thing to me, which is why I thought digging out this old thread is worth it.

eastgate · August 24, 2023, 1:15pm

As it is, to search for a note with two words in it, I have to create action code with four actions, three boolean operators, and two sets of parens.

No. To search for a note that contains Vancouver and also contains doctors, ⌘-F Vancouver&doctors.

I’m willing to think about this again. But (hint hint) the way to convince me is to show at least one of the following:

Your alternative approach makes possible things that aren’t possible at present.
Your alternative approach makes things easy which are now possible only with difficulty, and the things it makes easy are important things to lots of Tinderbox users.
Your approach makes some things easier and some things more difficult, but you can present hard evidence that the things it makes easier are more important or more common.
You can’t present hard evidence, but you can show how hard evidence could be obtained.

eastgate · August 24, 2023, 2:28pm

Let’s think the Google query Vancouver doctors in the Tinderbox context. (The Google context — all the text in the world — isn’t a lot like the Tinderbox context.)

In a small document — say, notes on The 34th Intl. Conference On Time Travel — neither term will be very common. If you’re trying to find your notes on that interesting keynote Thursday about Dr. Who and the Vancouver Grizzlies, searching for either term will find what you’re looking for.
In a big document about a specific topic — your notes for a doctoral dissertation on Emergency Room Medicine In The 19th Century: Direct Observations — one of the terms may be mere confirmation. Hundreds of your notes will mention “doctor” and only a few will mention “Vancouver”. So, search for Vancouver and then do a text search (or visual scan) to skip over your receipts for that dinner at Joe Fortes’s.
In a big, unfocused document — notes on all the books you’ve read in the past decade — it might be worth searching “Vancouver|doctor” because neither term is very common. (Quick: have you read a mystery set in Vancouver? Have you read two? What’s the last mystery you read in which a doctor was prominent?).

These all contemplate a core Tinderbox task: having accumulated hundreds or thousands of notes on a topic, you want to locate a half-remembered note. That chore is the reason ⌘-F exists.

In a large and active Tinderbox document, we have an agent that gathers notes on a topic of particular interest. Perhaps we have a daily review of the most recent notes on this topic. This agent works best if it is precise — if it doesn’t list many notes that aren’t relevant. For example, we might want to highlight the most recent notes about “Vancouver doctors” even though there are lots of receipts for tasty Vancouver restaurants, and also lots of receipts for the Doctor Whatsis, your psychotherapist in Sheboygan. In this case, we really might want all the notes that mention Vancouver and also mention doctors. The agent query $Text.contains("Vancouver") & $Text.icontains("doctor") explains precisely what you’re looking for.

Incidentally, I think functions help encapsulate queries in an interesting way. If we want fairly complicated logic for a query, it may be better to write a simple function wantsDailyReview(var:string theNote) than to mess around with nested conditional.

rald · August 24, 2023, 4:28pm

Thanks for elaborating @eastgate ! This lines up well with what I suspected people fall back on: just querying for single words to keep the query broad.

However, I think there is quite a point to be made for how “the google way” could make things significantly easier.
Google is pretty smart, because it processes your query adaptively – depending on the results. It essentially does automatically what you’d do manually in your 3 cases.
Going through the cases you supplied in reverse order works well to illustrate that:

Big unfocused document (3.): This is the most like a everyday web-search using google. There is lots of hits for both terms and thus google will show you results of pages with both terms. (I am unsure, but think I remember finding out that they also prioritize if order matches.)
Big document about specific topic (2.): no difference here, with the google way you also have to discard search terms giving you too many results.
Small document (1.): This resembles a web search on a very exotic topic, where there is not that much content to be found. In this case, google will adapt to that and start showing you results that only contain one or the other search term.

So to wrap up the benefit of the google way, as I see it:

You get the results you want for both case 1. and 3. using the same, dead simple query and just starting one Find.
The benefit is actually biggest for the case, where you thought there definitely was something with “Vancouver&doctor”, but it turns out there is neither results for “Vancouver” nor for “doctor”. With the current Find you’d end up running 3 queries – google would directly shift into searching for either of the terms and tell you there is nothing for “Vancouver&doctor”, “vancouver” and “doctor”.

Would not be surprised, if you can come up with an example where the manual approach allows you to do something one can’t with the google way however.

mwra · August 24, 2023, 4:45pm

But the view pane find (Cmd+F) does that by default. If you type in the box Library of Congress the matches are to the exact phrase, not instances of any individual word. IOW, congress in its own does not match.

Noting that defaults of the two optimisations in the Find bar’s pop-up menu:

case sensitive (off by default)
regex (on by default)

…then typing Library of Congress in the find box is the same as the agent query:

$Text.icontains("Library of Congress") | $Name.icontains("Library of Congress")

(you can optionally pick to also OR-include include a single user attribute as the source - or use any one or two of those three possible sources)

So, to my understanding, Find view already defaults to the the Google notion that the quoted term must be in the answer. TBH though, google doesn’t play by that rule and offers you other irrelevant matches without the term—I suspect it occurs if the algo worries it has too few matches.

So I’m unclear as to what problem you are solving. What do you mean by a “broad and coarse query”? One that you can’t define but which gives the desired result? I don’t mean the last in a snarky sense, but search result effectiveness can be highly subjective and as much lack as correct user input. I generally find Google needs more prodding to get close to answer links worth clicking partly as its indexing tends to default to too wide a match. also is there a difference, in your experience in the app of using find with single words as opposed to phrases? If so, in what way? It seems the OP’s point was about searching a list of words (implicit: also phrases) co-occurring in a searched attribute but it seems you want something different.

satikusala · August 25, 2023, 10:51am

Hi there, I’m curious, what and I doing wrong? When I hit enter with “Vancouver&doctors” in the search result dialog does not pop up. If I do one or the other word it does.

eastgate · August 25, 2023, 1:26pm

You’re right. I invented some regex syntax that doesn’t exist.

mwra · August 25, 2023, 2:16pm

Yes, Find only searches on one term across between minumum one and maximum three attributes. The search carried out against the attributes is the same as query:

$Attr.icontains("My Search Term")

Of the two drop-down options, ticking ‘case sensitive changes’ the query to:

$Attr.contains("My Search Term")

Dropping the regex option makes the query—without case sensitivity:

$Attr ==  "My Search Term".lowercase();

or no regex with case sensitivity:

$Attr == "My Search Term";

It may not be what people want, but hopefully gives a better handle about what the Find box search is doing.

satikusala · August 26, 2023, 4:42pm

Gotcha, a multi-term find or a “near” operation could be pretty coll. Perhaps we bring the discussion to the backstage, including the idea of filtering search to a specific container.