teemperor requested changes to this revision.
teemperor added inline comments.
================
Comment at:
lldb/packages/Python/lldbsuite/test/functionalities/data-formatter/data-formatter-stl/libcxx/string/main.cpp:8
+// A corrupt libcxx string which points to garbage and has a crazy length.
+static unsigned char garbage_string_long[] = {185, 52, 168, 29, 1, 0, 0, 0,
168, 61, 175, 29, 1, 0, 0, 0, 104, 222, 174, 29, 1, 0, 0, 0};
+
----------------
I think those byte arrays need a quick comment about which elements mean what
(or how they trigger the respective code paths). Just pointing out which bytes
are supposed to overwrite which `std::string` members is good enough. Something
like a macro maybe? `#define STD_STRING_BYTES(cap, size, length) {cap, size,
length}`
================
Comment at:
lldb/packages/Python/lldbsuite/test/functionalities/data-formatter/data-formatter-stl/libcxx/string/main.cpp:29
+ if (sizeof(std::string) == sizeof(garbage_string_sso))
+ memcpy((void *)&garbage1, &garbage_string_sso, sizeof(std::string));
+ if (sizeof(std::string) == sizeof(garbage_string_long))
----------------
shafik wrote:
> vsk wrote:
> > shafik wrote:
> > > While I get what you are doing here, we know he structure of libc++ SSO
> > > implementation and we are manually building a corrupt one, this is
> > > fragile to changes in the implementation.
> > >
> > > I don't have an immediate suggestion for an alternative approach but if
> > > we stick with this we should stick a big comment explaining this, perhaps
> > > laying out the assumptions of the internal layout we are assuming and
> > > maybe some sanity checks maybe using `offsetof` to verify fields exist
> > > and are where we expect them to be.
> > I don't see how this is fragile. The structure of libc++'s SSO
> > implementation is ABI, and is unlikely to change (esp. not in a way that
> > turns either one of the garbage strings into a valid string). I've left
> > comments explaining what's wrong with both of the garbage strings, but can
> > leave a pointer to https://joellaity.com/2020/01/31/string.html for more
> > info?
> Sure, that note would be fine.
Can you instead do a `#if _LIBCPP_ABI_VERSION == 1` and have the #else as an
#error that this test needs updating. We don't support any other libc++ ABI
beside 1 in LLDB but if we ever do then this should not silently pass.
================
Comment at: lldb/source/DataFormatters/StringPrinter.cpp:73
uint8_t *&next) {
+ assert(isInHalfOpenRange(buffer, buffer, buffer_end) &&
+ "Cannot read the first byte of ASCII string buffer");
----------------
Isn't this just `assert(buffer<buffer_end)`? That's less confusing IMHO (and I
think in general this check can be in `GetPrintable` as this should always be
true for all `GetPrintableImpl`).
================
Comment at: lldb/source/DataFormatters/StringPrinter.cpp:140
uint8_t *&next) {
+ assert(isInHalfOpenRange(buffer, buffer, buffer_end) &&
+ "Cannot read the first byte of UTF8 string buffer");
----------------
Same as above.
================
Comment at: lldb/source/DataFormatters/StringPrinter.cpp:149
+ if ((utf8_encoded_len == 0 || utf8_encoded_len > 4) ||
+ !isInHalfOpenRange(buffer + (utf8_encoded_len - 1), buffer, buffer_end))
return retval;
----------------
Isnt' `!isInHalfOpenRange(buffer + (utf8_encoded_len - 1), buffer,
buffer_end))` just `buffer + (utf8_encoded_len - 1U) < buffer_end`?
`utf8_encoded_len` is always positive so the check if it adding it to `buffer`
makes it smaller than `buffer` can only happen with an integer overflow IIUC
(which we probably should check against more explicitly then).
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D73860/new/
https://reviews.llvm.org/D73860
_______________________________________________
lldb-commits mailing list
[email protected]
https://lists.llvm.org/cgi-bin/mailman/listinfo/lldb-commits