![]() |
KnowBrainer Speech Recognition | ![]() |
Topic Title: FindText Topic Summary: Really? Created On: 08/29/2022 10:31 PM Status: Post and Reply |
|
![]() |
![]() |
- ax | - 08/29/2022 10:31 PM |
![]() |
![]() |
- ax | - 08/29/2022 11:49 PM |
![]() |
![]() |
- Ag | - 08/30/2022 11:51 PM |
![]() |
![]() |
- ax | - 08/31/2022 02:45 AM |
![]() |
![]() |
- ax | - 08/31/2022 03:29 PM |
![]() |
|
I mean really! Have I not heard of Ctrl-F? Well every day one learns something and forgets a little more.
While I do try to keep Ctrl-F in mind, I am here to introduce an AHK ImageSearch refinement going by the intuitive name of "FindText": https://www.autohotkey.com/boards/viewtopic.php?t=17834, whose primary utility, I think, is to search text on a screen as images.
The inspiration to try this came from none other than the Chief Ideas Officer, Ag. See thread here please: https://www.knowbrainer.com/forums/forum/messageview.cfm?catid=4&threadid=36598
Most AHKaky type of things are community endeavours to various extent. This one is no exception. It's open source of course. Due to author's ESL status, introductions were written by other users!
The package is fairly functional as I just started playing with it in the last hour. 90% of the menu I still don't quite understand. I will in the next little while try to digest some of the tutorials. They are hard to get a grasp on without going hands-on.
But it's packaged in such a way that not a lot of "hand-holding" is needed just to get started. Let's say I am looking for the "word" "DMO:" as seen in the following screenshot of this forum:
NB: above is not the search field/haystack. My search haystack is the entirety of my 3 monitors. It's that kind of speed refinement that makes this utility useful. I will set a box 30 pixel by 12 pixel and go on cropping my "text", or needle, rather, to allow for search later in the haystack.
And one would then get something like below, after mandatory "ASCII-artification", so-to-speak, which converts the cropped image to "black and white" (I just click the "GrayDiff2Two" button ... will read the "manual" later). I haven't a clue the ins and outs here.
Now if I go click the "Test" button above, to search for this ASCII art "DMO:" across the whole of my 3 screens (WUXGA x2 + UXGA x1), it takes a mere average of 130ms to find it.
13ms would be better. But anything under 300ms I think is relatively acceptable from a "UX" point of view.
And that's with a feeble Quadro M620 in my 5-year-old Z2 Mini G3. I kinda was hoping there might be justification to upgrade to something more modern, like an RTX A2000, before certain straight boils over and computer and parts go the way of natural gas ... I mean, a nerd may not have to shower, but no computers? ¯\_(`;`)_/¯ |
|
|
|
![]() |
|
Now I just quickly tested over RDP on my office rig, into which I shoved a T400 gpu a few months ago to better handle the triple monitors. And it actually takes longer to find the "word" there, at around 230ms.
@Ag, I could see the utility being at least partially useful to your desired workflow, which I must say seems quite "high-end" (your workflow, that is). |
|
|
|
![]() |
|
Looks very cool! or perhaps I should say sounds very cool, because I haven't looked at the code yet.... OK, now it looks cool.
"The window containing the "needle" has to be visible for search to be successful. Can't be behind other windows." Yeah, this limitation is why didn't use the macOS version for very long: it worked really well as an alternative to "number buttons" and "click button number", but it wasn't really all that good for automating. Especially with modern apps where the things that are on the screen, the menus, etc., depend very much on screen size. but I suppose I'll need to play with It this weekend thanks! ------------------------- DPG15.6 (also DPI 15.3) + KB, Sennheiser MB Pro 1 UC ML, BTD 800 dongle, Windows 10 Pro, MS Surface Book 3, Intel Core i7-1065G7 CPU @ 1.3/1.5GHz (4 cores, 8 logical, GPU=NVIDIA Quadro RTX 3000 with Max-Q Design. |
|
|
|
![]() |
|
Due to unforeseen circumstances, I actually got a bit of unscheduled "free time" on my hands this week. So I have been going through the tutorials as we speak in the last 2 hours. |
|
|
|
![]() |
|
https://www.autohotkey.com/boards/viewtopic.php?f=7&t=102806
P.S., Very preliminary field testing showed that the search can in fact handle small amount of shading, which I suppose makes sense as long as there is enough of a "gradient" between characters and background (again colour inversion requires a whole new set of "needles").
Going with the "OCR-styled" search, I do find that the initial "definition" of the AlphaNumerics to be quite tedious and time-consuming. But once set up, the speed of searching even long strings is simply mind-bogglingly fast, at least for the menu items I tend to deal through Citrix.
There have been some false +ves and false -ves, e.g. "ff" and "i" in the middle of words (in Verdana 14 font), hyphens, and underscores. Generally I have been able to work around. |
|
|
FuseTalk Standard Edition v4.0 - © 1999-2023 FuseTalk™ Inc. All rights reserved.