On Fri, 8 Aug 2025, Jeremy Drake via Cygwin wrote:

> On Fri, 8 Aug 2025, Brian Inglis via Cygwin wrote:
>
> > On 2025-08-08 00:53, Thomas Wolff via Cygwin wrote:
> > > Am 08.08.2025 um 02:31 schrieb Jeremy Drake via Cygwin:
> > >> On a case-insensitive but case-preserving filesystem, is there a Cygwin
> > >> API to get the on-disk case for a given path?  It seems like `realpath`
> > >> ought to do it but running
> > >> $ touch case-test
> > >> $ realpath CASE-TEST
> > >> returns CASE-TEST.
> > > On the command line, you could use
> > > ls | grep -i
> > >
> > >> Regardless, canonicalize_file_name or realpath may not
> > >> be what I want because it would dereference symlinks.
> > >>
> > >> Background: I'm trying to debug some test failures in Clang, due to a
> > >> warning that's supposed to be issued when you #include "foo.h" but the
> > >> file on disk that it opened is "Foo.h".
> >
> > Looks like if you use wildcards, it should work correctly:
> >
> > $ lsattr -dl .
> > .                            ---
> > $ l *_exit*
> > _Exit.2  _exit.3
> > $ l _exit.?
> > _Exit.2  _exit.3
> > $ l _exit.[23]
> > _Exit.2  _exit.3
> > $ l *EXIT-TEST*
> > exit-test
> > $ l *exit*
> > _Exit.2  _exit.3  EXIT  exit-test
> >
> > also, you could just opendir(3)/readdir(3)/closedir(3) and strcasecmp(3).
> >
>
> Yeah, globbing is opendir/readdir/closedir, but it'd have to be done on
> each path component to recover the on-disk case for a path.  I'll explain
> now that I have a better idea what clang is doing under the hood.
>
> llvm has an API that opens a file with an out parameter for the "real"
> path.  On Windows, it uses GetFinalPathNameByHandleW
> (FILE_NAME_NORMALIZED|VOLUME_NAME_DOS), with some massaging for UNC.  On
> Unix, it first would prefer fcntl with F_GETPATH.  It doesn't look like
> Cygwin provides that.  If that's not available, available, it tries
> readlink on /proc/self/fd/%d.  If that's not available, it falls back to
> realpath on the name input.  I had a breakpoint on realpath that was not
> hit, so it appears that readlink is what it's doing.
>


I came up with this hack that seems to work.  It's pretty stupid, but
maybe a start of a discussion?  Maybe it'd make more sense to factor out
the GetFinalPathNameByHandleW-handling code from symlink_info::check and
call that to convert it to a posix path?

diff --git a/winsup/cygwin/fhandler/base.cc b/winsup/cygwin/fhandler/base.cc
index 5321ad7ff1..cbdd20d743 100644
--- a/winsup/cygwin/fhandler/base.cc
+++ b/winsup/cygwin/fhandler/base.cc
@@ -29,6 +29,7 @@ details. */
 #include "shared_info.h"
 #include <asm/socket.h>
 #include "cygwait.h"
+#include "tls_pbuf.h"

 static const int CHUNK_SIZE = 1024; /* Used for crlf conversions */

@@ -133,6 +134,27 @@ char *fhandler_base::get_proc_fd_name (char *buf)
       stpcpy (stpcpy (buf, get_name ()), " (deleted)");
       return buf;
     }
+  if (get_device () == FH_FS && get_name ())
+    {
+      tmp_pathbuf tp;
+      PWCHAR fpbuf = tp.w_get ();
+      DWORD ret;
+
+      ret = GetFinalPathNameByHandleW (get_handle (), fpbuf, NT_MAX_PATH, 0);
+      if (ret)
+       {
+         PWCHAR ubuf = tp.w_get ();
+         UNICODE_STRING uc = {2, 2, ubuf + sys_mbstowcs (ubuf, NT_MAX_PATH, 
get_name ())},
+                        fc = {2, 2, fpbuf + ret + 1};
+         while (--uc.Buffer >= ubuf && --fc.Buffer >= fpbuf &&
+             (RtlCompareUnicodeString (&uc, &fc, TRUE) == 0 ||
+              (iswdirsep (*uc.Buffer) && iswdirsep (*fc.Buffer))))
+           if (!iswdirsep (*uc.Buffer))
+             *uc.Buffer = *fc.Buffer;
+         sys_wcstombs (buf, NT_MAX_PATH, ubuf);
+         return buf;
+       }
+    }
   if (get_name ())
     return strcpy (buf, get_name ());
   if (dev ().name ())

-- 
Problem reports:      https://cygwin.com/problems.html
FAQ:                  https://cygwin.com/faq/
Documentation:        https://cygwin.com/docs.html
Unsubscribe info:     https://cygwin.com/ml/#unsubscribe-simple

Reply via email to