28

I know that Javac compiler is able to transform String concatenation + using StringBuilder/StringBuffer, and I'm curious to know starting from which version this change was introduced?

I'm using this sample code:

public class Main {
  public static void main(String[] args) {
      String a = args[0];
      String s = "a";
      s = s + a;
      s = s + "b";
      s = s + "c";
      s = s + "d";
      s = s + "e";
      System.out.println(s);
  }
}

So far I've tried with javac 1.8.0_121, javac 1.6.0_20, javac 1.5.0_22 and java 1.4.2_19.

Here is a sample of the bytecode I see using javap -c from 1.4.2_19:

6:  astore_2
7:  new #3; //class StringBuffer
10: dup
11: invokespecial   #4; //Method java/lang/StringBuffer."<init>":()V
14: aload_2
15: invokevirtual   #5; //Method java/lang/StringBuffer.append:(Ljava/lang/String;)Ljava/lang/StringBuffer;
18: aload_1
19: invokevirtual   #5; //Method java/lang/StringBuffer.append:(Ljava/lang/String;)Ljava/lang/StringBuffer;
22: invokevirtual   #6; //Method java/lang/StringBuffer.toString:()Ljava/lang/String;

All 4 versions seems to be using the StringBuilder/StringBuffer optimization, so I'm curious to know starting from which Javac version this change was introduced?

4
  • Probably since the beginning... Commented Feb 27, 2017 at 14:18
  • Java 5 if I'm not mistaken Commented Feb 27, 2017 at 14:19
  • More useful would be to know if any of them can do this for a string built with a loop. Commented Feb 27, 2017 at 16:52
  • 1
    Note that it isn't always optimized automatically by the compiler. I brought a Linux Server with Java 8 to its knees last year, by using String#+ instead of StringBuilder. I wanted to build a reasonably large (about 300MB) gnuplot file. There was some logic involved, and I had to build two 150MB strings in parallel, and concatenate them at the end. With String#+, it took half an hour and all the memory available. With StringBuilder, it took just a few seconds and much less memory. Commented Feb 27, 2017 at 18:35

4 Answers 4

32

Here's a quote from the language specification from version 1:

An implementation may choose to perform conversion and concatenation in one step to avoid creating and then discarding an intermediate String object. To increase the performance of repeated string concatenation, a Java compiler may use the StringBuffer class (§20.13) or a similar technique to reduce the number of intermediate String objects that are created by evaluation of an expression.

Back at the time, they had StringBuffer instead of StringBuilder.

Also a quote from StringBuffer of JDK1.0.2:

This Class is a growable buffer for characters. It is mainly used to create Strings. The compiler uses it to implement the "+" operator.

Sign up to request clarification or add additional context in comments.

11 Comments

Nice find, but now I have to despise my former teachers for teaching me to use the uselessly verbose syntax of StringBuffer/Builder ;-( I'll try to find some solace believing that it might not have been optimized in the early version of the OpenJDK, please nobody ruin this :p
@Aaron StringBuilder should usually be used when constructing complex strings, e.g. in loops (see stackoverflow.com/questions/1532461/…).
Oh, thanks for the precision ! While it's not the point of the question, I think it might still be interesting to add in your answer, as OP and other readers might ignore that this optimisation isn't always possible and could conclude as I erroneously did that using string concatenation everywhere is fine.
@Aaron it's not needlessly verbose. Using StringBuilder directly gives you control over the initial buffer size. Estimating it could save you some expensive resizing of the underlying char array.
@Aaron I wouldn't call it premature optimization. If a loop does, for example, 10000 concatenations it's equally silly to use + as a StringBuilder with the default buffer of 16 characters. One case where + is actually faster is when concatenating static string values. In such cases the resulting string can be built at compile time.
|
11

I have looked up the Java Language Specification, First Edition (from 1996). Not an easy find, but here it is. The passage on concatenation optimization was there even then:

An implementation may choose to perform conversion and concatenation in one step to avoid creating and then discarding an intermediate String object. To increase the performance of repeated string concatenation, a Java compiler may use the StringBuffer class (§20.13) or a similar technique to reduce the number of intermediate String objects that are created by evaluation of an expression.

The specification pertained to StringBuffer then, but StringBuilder (which current JLS wording refers to) might be deemed better performing because its methods are not synchronized.

This, however, does not mean that one should rely on the optimization as always being in place. String concatenation in loops will not get optimized, for example.

Comments

5

JLS has already been given in some answers. I just want to make the point that StringBuffer (https://docs.oracle.com/javase/8/docs/api/java/lang/StringBuffer.html) was there since 1.0 whereas

StringBuilder(https://docs.oracle.com/javase/8/docs/api/java/lang/StringBuilder.html) came in version 1.5. Please see the since: section of respective javadocs.

1 Comment

No need to dive into the JLS or bytecode...just check the Javadoc for StringBuilder! It says Since: 1.5. So this answer is the best answer. (+1)
5

This does not answer the question, but I want to merely add to the overall point that in jdk-9 this StringBuilder::append is one of the permitted strategies, but not the default one.

private enum Strategy {
   /**
    * Bytecode generator, calling into {@link java.lang.StringBuilder}.
    */
    BC_SB,

    /**
     * Bytecode generator, calling into {@link java.lang.StringBuilder};
     * but trying to estimate the required storage.
     */
    BC_SB_SIZED,

    /**
     * Bytecode generator, calling into {@link java.lang.StringBuilder};
     * but computing the required storage exactly.
     */
     BC_SB_SIZED_EXACT,

   /**
    * MethodHandle-based generator, that in the end calls into {@link java.lang.StringBuilder}.
    * This strategy also tries to estimate the required storage.
    */
    MH_SB_SIZED,

    /**
     * MethodHandle-based generator, that in the end calls into {@link java.lang.StringBuilder}.
     * This strategy also estimate the required storage exactly.
     */
    MH_SB_SIZED_EXACT,

    /**
     * MethodHandle-based generator, that constructs its own byte[] array from
     * the arguments. It computes the required storage exactly.
     */
     MH_INLINE_SIZED_EXACT
}

It's actually an invokedynamic bytecode for String concatenation, so it's implementation is now JRE specific, not compiler one. The default strategy btw is : MH_INLINE_SIZED_EXACT

2 Comments

Sorry but could you explain what permitted strategies means? Is it some runtime optimization dedicated to string concatenation?
@glee8e I think I've covered the topic a bit here: stackoverflow.com/questions/40267601/…

Your Answer

By clicking “Post Your Answer”, you agree to our terms of service and acknowledge you have read our privacy policy.

Start asking to get answers

Find the answer to your question by asking.

Ask question

Explore related questions

See similar questions with these tags.