Re: find file containing text inside the file

Tech-Archive recommends: Repair Windows Errors & Optimize Windows Performance




pacinitaly;1173379 Wrote:
SAnton;1173259 Wrote:
Imagine the folowing situation: I downloaded .zip-archive with thouzands
of .php-files. I unpacked the file to the folder at my Desktop (which is
indexed by default). I want to find all files containing string «Test».
So, I added «Plain text filter» to the .php files extension in the
indexing settings. Should I wait hours until the indexing will be done?
Or I can just search (more slowly)? I tryed and did not get any results!
So, I will wait. It's very annoying!

Another case: I want to find files containing Russian word
«Привет». The problem is that that
word can be encoded by different bytes depending on codepage (most
popular are cp1251, UTF-8, KOI8-R). I tested text search and found that
if BOM present at the start of the file, file considered UTF-8,
otherwise file considered cp1251. But how about UTF-8 files which do not
have BOM mark at the beginning? When I program PHP I always remove BOM,
because othervise these BOMs are sended to the user-agent by server.
Notepad application still detects UTF-8 in such files, but Search is not
(see attached files). Also, how about KOI8? I didn't find any way to
specify encoding in search options.

Third case: how about different word forms in Russian? «One day» =
«Один день», «Three
days» = «Три дня», «Five days» =
«Пять дней», «All these
days» = «Все эти
дни»,... How I will find
«день» («day») with all possible forms?

It's a pity that Windows text search is completely useless. :(> >

love the post!!!

try 'Super Finder 1.5.3.0 Free Download'
(http://freemedia.realgage.com/super-finder-1530.html)

I have tryed Super Finder. It fast, perform real content search. That
is good.

First I tryed it on my folder with 300 PHP-files (don't worry, these
files are English-only). It found 3 files compared to the 1 file found
by Vista using the same search query. That is very strange: it looks
like Vista Search is approximate and unreliable. Imagine that your boss
required you to find all files containing "bucks" and replace these by
"dollars". Failed to perform this task may result in serious
consequences for you.

The bad thing is that Russian text was found only in CP1251-encoded
text files. UTF-8 files (with and without BOM) are ignored. I tryed to
check «unicode» checkbox in the Super Finder, but in that case I got
zero results. This is worse than Vista search, which at least can
recognise UTF-8 by BOM.


--
SAnton
.



Relevant Pages

  • Re: Re: find file containing text inside the file
    ... Imagine the folowing situation: ... if BOM present at the start of the file, file considered UTF-8, ... the Desktop folder is *not* indexed by default. ...
    (microsoft.public.windows.vista.file_management)
  • Re: find file containing text inside the file
    ... if BOM present at the start of the file, file considered UTF-8, ... First I tryed it on my folder with 300 PHP-files (don't worry, ...
    (microsoft.public.windows.vista.file_management)
  • Re: Trouble importing foreign language accents into Access 2003
    ... I see no BOM at the top of either file. ... verify for the presence or the absence of a UTF-8 BOM (Byte ... Sylvain Lafontaine, ing. ... MVP - Windows Live Platform ...
    (microsoft.public.access.externaldata)
  • Re: aps.net : BIG BUG in streamwriter
    ... look the BOM! ... editor which proceeds to rewrite it as UTF-16? ... when i want deserialize it with an utf-8 encoding... ...
    (microsoft.public.dotnet.languages.csharp)
  • Re: Custom Resource, XML problem
    ... Why are you assuming that it is 8-bit characters? ... //JWxml is namespace used by CXml ... which is then screamingly obvious as the UTF-8 Byte Order Mark, ... BOM is the only meaning of BOM in my brain was for "Bill Of Material" which ...
    (microsoft.public.vc.mfc)