Add limits when deserializing `BigDecimal` and `BigInteger` by Marcono1234 · Pull Request #2510 · google/gson

Marcono1234 · 2023-10-11T18:48:11Z

Purpose

Adds limits when deserializing BigDecimal and BigInteger

Checklist

New code follows the Google Java Style Guide
If necessary, new public API validates arguments, for example rejects null
New public API has Javadoc
- Javadoc uses @since $next-version$
  ( $next-version$ is a special placeholder which is automatically replaced during release)
If necessary, new unit tests have been added
- Assertions in unit tests use Truth, see existing tests
- No JUnit 3 features are used (such as extending class TestCase)
- If this pull request fixes a bug, a new test was added for a situation which failed previously and is now fixed
mvn clean verify javadoc:jar passes without errors

eamonnmcmanus

Thanks for taking this on!

eamonnmcmanus · 2023-10-11T19:18:33Z

gson/src/main/java/com/google/gson/stream/JsonReader.java

    long value = 0; // Negative to accommodate Long.MIN_VALUE more easily.
    boolean negative = false;
    boolean fitsInLong = true;
+    int exponentDigitsCount = 0;


What would you think of instead recording the index of the first exponent digit? Then you could still easily reject a number if it has too many exponent digits. Or you could parse the digits and compare against a limit, which would fix the leading-zero problem.

What would you think of instead recording the index of the first exponent digit?

I guess that would be possible, but it might then rely on the buffer not being refilled (and indices not changing) during parsing, which works because this is currently the case. Though from that perspective tracking the count seems a bit more reliable to me.

Is your concern performance (I assume not)? Or whether tracking the index might lead to cleaner or easier to understand code?

Or you could parse the digits and compare against a limit, which would fix the leading-zero problem.

I mainly omitted the leading 0s check for simplicity (but nonetheless added a comment and test to document the behavior). Fixing it might also be possible by slightly adjusting this method to explicitly check for 0. For example:

... } else if (last == NUMBER_CHAR_EXP_E || last == NUMBER_CHAR_EXP_SIGN) { last = NUMBER_CHAR_EXP_DIGIT; + if (c != '0') { exponentDigitsCount++; + } } else if (last == NUMBER_CHAR_EXP_DIGIT) { + if (exponentDigitsCount > 0 || c != '0') { exponentDigitsCount++; // Similar to the scale limit enforced by NumberLimits.parseBigDecimal(String) // This also considers leading 0s (e.g. '1e000001'), but probably not worth the effort ignoring them if (exponentDigitsCount > 4) { throw new MalformedJsonException("Too many number exponent digits" + locationString()); } + } }

I just wasn't sure if that many leading 0s is really a common use case and worth supporting.
Should I solve this though with the diff shown above (and some comments)?

The parsing logic is fairly complicated, but it seems to me that the way the i variable works is quite simple, starting from 0 and increasing. Even if the buffer is refilled and pos changes, i is still a valid offset. So I think saving its value and using it at the end is reasonably robust. And I feel the resulting code would be slightly simpler. It also seems very slightly better to check for a specific maximum exponent rather than a number of exponent digits.

I still have the question of whether we need to change JsonReader at all, if we're also checking for large exponents in TypeAdapters.

It also seems very slightly better to check for a specific maximum exponent rather than a number of exponent digits.

Ok, I will give it a try.

I still have the question of whether we need to change JsonReader at all, if we're also checking for large exponents in TypeAdapters.

Changing JsonReader is mainly for the cases where users directly obtain the number from the JsonReader using nextString() and then parse the number themselves. Do you think this should not be done? In that case is it ok if I change the documentation of nextString() to tell the users to validate the number string, if necessary?

Obsolete, I have reverted the changes to JsonReader, see #2510 (comment)

gson/src/test/java/com/google/gson/functional/PrimitiveTest.java

eamonnmcmanus · 2023-10-11T19:23:26Z

gson/src/test/java/com/google/gson/functional/PrimitiveTest.java


-  @Test
-  public void testDeserializingBigDecimalAsFloat() {
-    String json = "-122.08e-2132332";


This is unfortunate. Do we need the logic in JsonReader or would it be enough to just have the new logic in TypeAdapters at the point where the number is converted to a BigDecimal or BigInteger?

I mainly removed these tests because a few lines above something similar is covered already:

gson/gson/src/test/java/com/google/gson/functional/PrimitiveTest.java

Lines 1033 to 1036 in c98efd8

assertThat(gson.fromJson("-122.08e-2132", float.class)).isEqualTo(-0.0f);

assertThat(gson.fromJson("-122.08e-2132", double.class)).isEqualTo(-0.0);

assertThat(gson.fromJson("122.08e-2132", float.class)).isEqualTo(0.0f);

assertThat(gson.fromJson("122.08e-2132", double.class)).isEqualTo(0.0);

To me it also does not look like this ...e-2132332 is any specific number but just an arbitrary number with a large amount of exponent digits.
For comparison, Double.MIN_VALUE is 4.9e-324.

I assume it would be possible to adjust the logic in JsonReader and defer the check until nextString() is called, but that might make it a bit more complicated, possibly having to check the string there, or adding a new field indicating whether the parsed number exceeded the limits.
And because at least these numbers here are so contrived I am not sure if that is worth it.

Oh, I misunderstood. I thought these tests were removed because these values were now rejected. If they still produce the same results but are redundant then it's fine to remove them.

Sorry, you did understand it correctly; they are rejected now.

But I removed them instead of adjusting them (e.g. to use ...e-9999) because in my opinion they did not add much value compared to the other tests: 122.08e-2132 is already parsed as 0.0, so testing -122.08e-2132332 seemed a bit pointless to me.

Do you think this should be supported though? I am not sure if these are really realistic values, or just dummy values for testing.

I have removed the limit checks in JsonReader now because they would also be ineffective when nextString() is called but the JSON data contains a JSON string (for which no restrictions exist) instead of a JSON number, and the user did not explicitly use peek() to verify that the value is a JSON number.

However, the parsing logic for Double.parseDouble (which is called by JsonReader.nextDouble) and Float.parseFloat is quite complex, see https://github.com/openjdk/jdk/blob/master/src/java.base/share/classes/jdk/internal/math/FloatingDecimal.java
But if it had performance problems for certain number strings, then it would affect all other parts of Gson which parse as double or float as well.

OK, so if I understand correctly, these tests would now pass, but they are redundant given testValueVeryCloseToZeroIsZero just above. The name of the method is testDeserializingBigDecimalAsFloat but actually it has nothing to do with BigDecimal. Is that right?

Yes, that is right.

The name of the method is testDeserializingBigDecimalAsFloat but actually it has nothing to do with BigDecimal.

No it doesn't look like it. These tests lead to Gson.doubleAdapter or Gson.floatAdapter being used, which call JsonReader.nextDouble. Neither BigDecimal nor its type adapter seems to be involved.

…umber-limits

Marcono1234 · 2023-10-22T18:55:42Z

@eamonnmcmanus, are the changes like this ok?

eamonnmcmanus · 2023-10-22T20:29:58Z

gson/src/test/java/com/google/gson/functional/PrimitiveTest.java


-  @Test
-  public void testDeserializingBigDecimalAsFloat() {
-    String json = "-122.08e-2132332";


OK, so if I understand correctly, these tests would now pass, but they are redundant given testValueVeryCloseToZeroIsZero just above. The name of the method is testDeserializingBigDecimalAsFloat but actually it has nothing to do with BigDecimal. Is that right?

eamonnmcmanus · 2023-10-23T20:51:24Z

Thanks again!

) * Add limits when deserializing `BigDecimal` and `BigInteger` * Use assertThrows * Don't check number limits in JsonReader

Add limits when deserializing BigDecimal and BigInteger

c98efd8

eamonnmcmanus reviewed Oct 11, 2023

View reviewed changes

Marcono1234 added 4 commits October 14, 2023 00:05

Merge remote-tracking branch 'remotes/origin/main' into marcono1234/n…

a1cedb5

…umber-limits

Use assertThrows

cf297ec

Don't check number limits in JsonReader

d1004f9

Merge branch 'main' into marcono1234/number-limits

5822681

eamonnmcmanus approved these changes Oct 22, 2023

View reviewed changes

eamonnmcmanus merged commit 6a9d240 into google:main Oct 23, 2023

Marcono1234 deleted the marcono1234/number-limits branch October 23, 2023 20:57

	assertThat(gson.fromJson("-122.08e-2132", float.class)).isEqualTo(-0.0f);
	assertThat(gson.fromJson("-122.08e-2132", double.class)).isEqualTo(-0.0);
	assertThat(gson.fromJson("122.08e-2132", float.class)).isEqualTo(0.0f);
	assertThat(gson.fromJson("122.08e-2132", double.class)).isEqualTo(0.0);

Conversation

Marcono1234 commented Oct 11, 2023

Purpose

Checklist

Uh oh!

eamonnmcmanus left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Marcono1234 Oct 11, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Marcono1234 Oct 22, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Marcono1234 commented Oct 22, 2023

Uh oh!

Choose a reason for hiding this comment

Uh oh!

eamonnmcmanus commented Oct 23, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Marcono1234 Oct 11, 2023 •

edited

Loading

Marcono1234 Oct 22, 2023 •

edited

Loading