CodeRunner: Using more than one JDK / Java prototype

Hi Richard,

To avoid breaking lots of old questions that have (support file jar) incompatibilities with a new JDK, we installed JDK 21 alongside the old JDK and wanted to add a new question prototype for JDK 21.

Here's what the developer said:

"I am trying to use two versions of java in coderunner using the same jobe server. On the jobe server I have

Supported languages:
    c: 11.5.0
    cpp: 11.5.0
    java21: 21.0.7
    java: 1.8.0_452
    nodejs: 22.16.0
    php: 8.0.30
    python3: 3.11.11

I created this by copying /application/libraries/java_task.php to /application/libraries/java21_task.php and changing /etc/alternatives/java to /etc/alternatives/java21

In moodle I copied the BUILT_IN_PROTOTYPE_java_class to create a java21 version and built a question based on that. I am getting the below error where is seems coderunner is using the language name ‘java21’ for the file extension.

Syntax Error(s)

error: Class names, '__tester__.java21', are only accepted if annotation processing is explicitly requested

1 error

Am I using the wrong approach or can someone help with this approach to allow multiple java version questions on the same system so we can create new questions and maintain the old.

We don't seem to be linking correctly to the new JDK. If we make the question prototype (or old question) sandbox language java21 we get the same error.

Is what we're trying to do possible and would you have any insight as to what we're doing wrong?

Thank you,
Anton

Re: Using more than one JDK / Java prototype

de Richard Lobb - jueves, 26 de junio de 2025, 13:48

You've fallen over a "feature" of Jobe that language tasks always generate a file of the form "__tester__.<language>". Because you're trying to define a new language java21 you've landed up with a tester with the extension .java21, which Java objects to. [AFAIK, Java is the only language that incorporates filenames into the language definition - grrrrr].

I really don't recommend trying to introduce new languages into Jobe. It's much easier to script them directly in CodeRunner. You can have a question prototype that uses a Python script to run anything you like on Jobe, which gives you much more flexibility with things like compile and runtime options. And it's way more maintainable. You can still set the Ace editor to use Java syntax colouring even though the Jobe task is (initially at least) Python.

I attach the XML export of a question PROTOTYPE_java21 together with a question that uses that prototype. It assumes you want a Java Program question type but it's easy to change it to Java Class or Java Method. The prototype assumes that the Java21 compiler is javac21 and the runtime is java21 - you can edit those two commands in the prototype template if necessary.

The template for the prototype is:

import subprocess
import re
import sys
student_answer = """{{ STUDENT_ANSWER | e('py') }}"""

# Need to determine public class name in order to name output file. Sigh.
# The best I can be bothered to do is to use a regular expression match.
match = re.search(r'public\s+class\s+([_a-zA-Z][_a-zA-Z0-9]*)', student_answer, re.DOTALL | re.MULTILINE)
if match is None:
    raise Exception("Unable to determine class name. Does the file include 'public class name'?")
classname = match.group(1)
filename = classname + '.java'

# Write the file
with open(filename, "w") as src:
    print(student_answer, file=src)
    
# Compile and run 
return_code = subprocess.call(['javac21', "-J-Xss64m", "-J-Xmx4g", filename])
if return_code != 0:
    print("** Compilation failed. Testing aborted **", file=sys.stderr)
else:    
    # If compile succeeded, run it. (You may want to tune the Java parameters here.)
    exec_command = ["java21", "-Xss16m", "-Xmx500m", classname]
    try:
        result = subprocess.run(exec_command, text=True, check=True)
    except subprocess.CalledProcessError as e:
        if e.returncode > 0:
            print(f"Task failed with a return code of {e.returncode}", file=sys.stderr)
        else:
            # Negative return codes are signals.
            print("Task failed with signal", -e.returncode, file=sys.stderr)
        print("** Further testing aborted **", file=sys.stderr)

Caveats:

This has had minimal testing, so may need some tweaking. But the style of question - using Python to run other languages - is very well tested.
The question type has set the runguard sandbox to not impose any memory limits, since the Java VM should manage those.
- You can adjust the memory limits in the Java run command.

Post back if you have any issues or need further help.

PROTOTYPE_java21.xml

Test Java21.xml

Re: Using more than one JDK / Java prototype

de Anton Dil - viernes, 27 de junio de 2025, 21:24

Hi Richard,

Thank you for the speedy reply and useful example. I seem to have a per-test template working, although I am unsure about the best values for the various compiler and VM flags. I'm thinking of ~1000 students trying to run tests in an exam at once. So far, so good :-)

Regards,
Anton

Re: Using more than one JDK / Java prototype

de Richard Lobb - domingo, 29 de junio de 2025, 19:21

Hi Anton

1000 students running Java at the same time is a pretty frightning scenario! We run up to 1000 student in Python with a 16-core Moodle server, an 8-core psql DB server and a pool of several Jobe servers (the number of these varies). We use a staggered start to ease the load on the db server. But Java is much more expensive on the Jobe servers. Jobe on my laptop can maintain a steady run rate of 35 - 40 minimal-sized C or Python questions per second, but only 7 Java questions. That implies you'd need at least 5 times as many Jobe servers. I understand OU now uses Amazon elastic compute cloud (is that right?) - I sure hope it can stretch to hande the Jobe server load!

BTW, I don't have useful advice on the Java VM runtime parameters. I don't teach Java and the Java users in our department make only relatively light use of CodeRunner. But FWIW the default parameters used by "native" Jobe Java runs are "-Xrs -Xss8m -Xmx200m". The parameters in the template above double both the stack and heap memory allocations. But you may want to consider adding the -Xrs. Someone (presumably one of my collaborators) has added the -Xrs parameter to jobe with the comment "reduces usage signals by Java, because that generates extra debug output when program is terminated on timelimit exceeded". Hopefully, though, time limit exceeded is a relatively rare event so I doubt it will greatly change the average throughput.

One final comment. If you're running Java Class or Java Method questions, you'd probably want to switch to using a combinator template. Otherwise, each test case will require a separate jobe run, which means a separate compile. Java compiles are what chew up most of the CPU time. You can even use a combinator template with write-a-program questions but you need to set the allow-mutliple-stdins checkbox, and do your own management of stdin for each run. By default, CodeRunner falls back to one-jobe-task-per-test mode if there are multiple stdins.

Richard

Re: Using more than one JDK / Java prototype

de Richard Lobb - lunes, 30 de junio de 2025, 09:49

A further thought. A collaborator added the following source code to Config/jobe.php on the Jobe server. Given you're planning on running a huge exam, you might want to consider adding these extra options to the template? Your call, of course - I have no experience with large Java classes in CodeRunner.

/*
|--------------------------------------------------------------------------
| Extra Java/Javac arguments [Thanks Marcus Klang
|--------------------------------------------------------------------------
|
| This section of the config file adds extra flags to java and javac
|
| Provided examples tells java/javac that there is only 1 core, which
| reduces the number of spawned threads when compiling. This option can be
| used to provide a better experience when many users are using jobe.
*/
public string $javac_extraflags = ''; //'-J-XX:ActiveProcessorCount=1';
public string $java_extraflags = ''; //'-XX:ActiveProcessorCount=1';

Re: Using more than one JDK / Java prototype

de Anton Dil - lunes, 30 de junio de 2025, 22:44

Thanks again Richard, more to ponder on! I am exploring a combinator template now to deal with the multiple compilation issue.

@Tim I would like them to have Check buttons, even if only to compile, or say pass/fail on the tests without showing them which ones passed or failed.

We still have occasional Jobe server errors now without any check buttons being shown, but had many the first time a check button was made available.

Re: Using more than one JDK / Java prototype

de Tim Hunt - lunes, 30 de junio de 2025, 21:01

Actually, we have run this exam for a few years now. Yes, it puts a big load on the server(s), but not more than they can handle - helped by the fact that this is an old-school exam moved online, so no check or pre-check, and there are only 3 CodeRunner questions.

There are two VMs like this. 8 cores, 40GB RAM. (We did increase that after the first year fo this exam.)

Re: Using more than one JDK / Java prototype

de Anton Dil - martes, 1 de julio de 2025, 02:28

I've been experimenting with the combinator and have it working up to a point, that is, when the answer is okay.

I'd like it to report just once if students' code doesn't compile, but I can't get this aspect working. I keep getting something similar to "Error in question Expected 3 test results, got 1. Perhaps excessive output or error in question?" when the answer doesn't compile.

I have been trying to compile the student's code separtely from the __tester__ class before compiling both together, but it doesn't seem to get that far in the template anyway.

Here's the template I am using (with some debugging output in it). When the student code doesn't compile I don't see that debugging output.

I could also compile the answer box code as a string, but I wasn't hoping to go that far, yet. Anyway t feels like something else is going on before the template is run.

import re
import subprocess
import sys

# 1. Get student answer and strip 'public' from class/interface/record
student_answer = """{{ STUDENT_ANSWER | e('py') }}"""

student_answer = re.sub(
r'\bpublic\s+(abstract\s+class|class|interface|record)\b',
r'\1',
student_answer
)

# 2. Extract classname from student code
match = re.search(
r'\b(?:abstract\s+)?(class|interface|record)\s+([_a-zA-Z][_a-zA-Z0-9]*)\b',
student_answer
)

if not match:
print("The answer didn't contain a class, interface or record.")
sys.exit(1)

classname = match.group(2)

# 3. Write student's Java code to file
with open(f"{classname}.java", "w") as f:
f.write(student_answer)

# 5. Compile student + tester files
cargs = ["javac21",
"-J-Xss512k",
"-J-Xmx64m",
"-J-XX:CompressedClassSpaceSize=32m"]

compile_cmd1 = cargs + [
f"{classname}.java"
]

comp = subprocess.run(compile_cmd1, capture_output=True, text=True)

if comp.returncode != 0: # first try compile by itself without template
print(comp.stderr)
sys.exit(2)
else:
print("Student code compiled")

# 4. Only now write __tester__.java using Twig loop syntax
with open("__tester__.java", "w") as f:

f.write("""
public class __tester__ {

private static final String SEPARATOR = "##";

public static void main(String[] args) {
__tester__ main = new __tester__();
main.runTests();
}

public void runTests() {
try {
{% for TEST in TESTCASES %}
{
{{ TEST.testcode }}
{% if not loop.last %}
System.out.print("##");
{% endif %}
}
{% endfor %}
}
catch (Throwable e) {
System.out.println("Exception: " + e);
}
}
}
""")

compile_cmd2 = cargs + [
f"{classname}.java",
"__tester__.java"
]

comp = subprocess.run(compile_cmd2, capture_output=True, text=True)

if comp.returncode != 0:
print(comp.stderr)
sys.exit(2)

# 6. Run compiled Java program
run_cmd = [
"java21",
"-Xms1m",
"-Xmx64m",
"-Xss512k",
"-XX:CompressedClassSpaceSize=32m",
"__tester__"
]

run = subprocess.run(run_cmd, capture_output=True, text=True)

print(run.stdout)
if run.returncode != 0:
print(run.stderr, file=sys.stderr)
sys.exit(3)

Re: Using more than one JDK / Java prototype

de Richard Lobb - martes, 1 de julio de 2025, 11:36

Thanks Tim for the interesting info on your set up. Having only 3 questions is a big win - we have 30 in our typical Python exams. But not having any sort of Check or Precheck functionality rather horrifies me.

Anton: Well done on biting the bullet and writing a combinator Java Class question type! Something I never got around to, as I don't teach Java.

Well done - you're so close! And you have my sympathy/apology for missing just one subtlety that may or may not be buried somewhere in the documentation but is highly non-obvious. Namely, that if you wish to abort processing of testcases in a combinator, you need to generate output to the stderr stream. You've printed your error messages to stdout, which is taken to mean that the test "worked" in the sense of not being wrong enough to cause an abort.

Here's my updated version of your template. Apart from changing your code to output all error messages to stderr, the only other change is removing your debugging message.

import re
import subprocess
import sys

# 1. Get student answer and strip 'public' from class/interface/record
student_answer = """{{ STUDENT_ANSWER | e('py') }}"""

student_answer = re.sub(
r'\bpublic\s+(abstract\s+class|class|interface|record)\b',
r'\1',
student_answer
)

# 2. Extract classname from student code
match = re.search(
r'\b(?:abstract\s+)?(class|interface|record)\s+([_a-zA-Z][_a-zA-Z0-9]*)\b',
student_answer
)

if not match:
    print("The answer didn't contain a class, interface or record.", file=sys.stderr)
    sys.exit(1)

classname = match.group(2)

# 3. Write student's Java code to file
with open(f"{classname}.java", "w") as f:
    f.write(student_answer)

# 5. Compile student + tester files
cargs = ["javac21",
"-J-Xss512k",
"-J-Xmx64m",
"-J-XX:CompressedClassSpaceSize=32m"]

compile_cmd1 = cargs + [
f"{classname}.java"
]

comp = subprocess.run(compile_cmd1, capture_output=True, text=True)

if comp.returncode != 0: # first try compile by itself without template
    print(comp.stderr, file=sys.stderr)
    sys.exit(2)

# 4. Only now write __tester__.java using Twig loop syntax
with open("__tester__.java", "w") as f:

    f.write("""
public class __tester__ {

  private static final String SEPARATOR = "##";

  public static void main(String[] args) {
    __tester__ main = new __tester__();
    main.runTests();
  }

  public void runTests() {
    try {
{% for TEST in TESTCASES %}
      {
        {{ TEST.testcode }}
{% if not loop.last %}
        System.out.println(SEPARATOR);
{% endif %}
      }
{% endfor %}
    }
    catch (Throwable e) {
      System.err.println("Exception: " + e);
    }
  }
}
""")

compile_cmd2 = cargs + [
f"{classname}.java",
"__tester__.java"
]

comp = subprocess.run(compile_cmd2, capture_output=True, text=True)

if comp.returncode != 0:
    print(comp.stderr, file=sys.stderr)
    sys.exit(2)

# 6. Run compiled Java program
run_cmd = [
    "java21",
    "-Xms1m",
    "-Xmx64m",
    "-Xss512k",
    "-XX:CompressedClassSpaceSize=32m",
    "__tester__"
]

run = subprocess.run(run_cmd, capture_output=True, text=True)

print(run.stdout)
if run.returncode != 0:
    print(run.stderr, file=sys.stderr)
    sys.exit(3)

I also attach an xml export of a silly test of this template.

When you've got it all working and tested to your satisfaction, please post the xml export of a prototype that I can include in the new unsupported question types folder. Or ... perhaps I should just throw away the existing Java Class question type and use this one instead? Let me know your thoughts.

I note that your new question type does two compiles rather than one, which is still a big win if you have greater than two tests. I assume you compile the student code separately to avoid confusing error messages being apparently in the test code, e.g. triggered by the student providing the wrong class name? You might be able to get away with a single compile by including a few syntax checks (in Python) on the supplied code? I'm not sure - it's a long time since I wrote much Java.

A final comment. If you want the ultimate in question types, you can use a combinator template grader, which gives you complete control of the output, though at a significant increase in complexity. For example, you can output compile error messages at the start, without any result table, rather than having them land up in the first test case. This is another area of CodeRunner that is lacking in public examples, unfortunately.

questions-TestAndDevel-Testing java class question using python.xml

Re: Using more than one JDK / Java prototype

de Richard Lobb - martes, 1 de julio de 2025, 12:28

Another thought: you probably don't need to capture the output from the subprocess runs, then print it. I think you should be able to just remove the output-capturing and let the subprocess print it directly?

However, if you ever want to switch to a combinator template grader, you not only need to capture all the output, you need to package it all up into the JSON result that should be the only output from the run.

Re: Using more than one JDK / Java prototype

de Anton Dil - miércoles, 2 de julio de 2025, 22:46

Hi Richard,

Thank you for the fix!

Good point, there is probably no point in capturing the run output. For the compilation I had thoughts of limiting the number of errors shown, as a single typo can trigger multiple compiler messsages. However, displaying only the first error isn't necessarily the best approach either.

As you suspected I wanted the first compilation message to give the correct filename and line number for the student's own code. For the second compilation we only really need __tester__.java in there, no need to recompile the student's code or other support files, which will be triggered by the compiler anyway.

One thing I'd prefer not to have is the ***Run error*** message, which if it appears above a compilation error may lead students to think their code was run. (Students often have a weak grasp of when errors are occurring.) But it doesn't look like that can be captured this way.

I'm still experimenting with support files but it's nice to see I can get checkstyle running in precheck this way.

My first thought about using this as a default class template was that the existing one is relatively easy for a Java coder to understand, but that only matters if you want to customise anyway, so maybe it is worth it.

Re: Using more than one JDK / Java prototype

de Anton Dil - miércoles, 9 de julio de 2025, 01:02

Hi Richard,

Here's my prototype - this seems to be working well although obviously it's not been tested by students. Some of the limits are set quite low but seem okay from what i've read. We don't tend to do anything resource heavy in these quizzes, no large files to load, or large data structures.

I've incorporated flags you mentioned after researching further.

Thanks for your help!

questions-M250-WS-PROTO_Java21_Class-20250708-1356.xml

Re: Using more than one JDK / Java prototype

de Richard Lobb - miércoles, 9 de julio de 2025, 10:59

Thanks Anton. I've already made use of the list of flags you've come up with in another forum posting :-)

Re: Using more than one JDK / Java prototype

de Anton Dil - viernes, 11 de julio de 2025, 02:20

Hi Richard, Looking at this prototype again I see that I have 0 and then in the test question I ticked 'is combinator'. This seemed to work, but surprises me now. Just another wrinkle for me, I seem to find a lot of them!

Re: Using more than one JDK / Java prototype

de Richard Lobb - viernes, 11 de julio de 2025, 10:14

Hi Anton. It sounds like you must have clicked Customise in the test question as well - otherwise you wouldn't be able to see the 'is combinator' checkbox. If you customise a question it copies the fields across from the prototype, allowing you to alter them. However, you're then disconnected from the prototype if you subsequently update it. That rather defeats the point of question prototypes, so I try not to customise individual questions except in rare cases.

The reason it's currently working is that you're using the template copied from the prototype (which is a combinator template) and have set the test question to be a combinator too. If you uncheck customise - which I recommend - your test question will break until you fix the prototype, making it a combinator.

As an aside, the Customise checkbox is a little quirky. If you check it, the UI switches to customise mode, allowing you to update any or all of the fields copied from the prototype. But the Customise state is not saved with the question; when you load a question, all fields are compared with the prototype's and if any are different, the customise checkbox is set.

Re: Using more than one JDK / Java prototype

de Anton Dil - viernes, 11 de julio de 2025, 20:42

Thanks Richard, that makes sense to fix the prototype and untick customise in the question.

Re: Using more than one JDK / Java prototype

de Richard Lobb - lunes, 21 de julio de 2025, 15:48

Hi Anton. Looking back, I see I never responded to your comment about the "***Run error***" message and how to get rid of it.

Unfortunately, the only way to do so is to switch to a combinator template grader instead of just the "simple" combinator you have. With a combinator template grader you take complete control of the grading process and the feedback to the student. We use combinator template graders for most of our in-house question types in order to give ourselves that complete control, but they are a big step up in complexity. You have to run the student code in a subprocess, capturing all the output and grading it yourself. You can return a custom compilation error message if compilation fails. Otherwise you can return a result table in your own preferred format (with your own selected column, column formats, embedded images etc) preceded or followed by your own HTML. But you have to make sure that the student code *never* escapes from your control, so you need to set your own timeout which must be less than the Jobe timeout and catch any exceptions.

Definitely not for the faint hearted. But I can provide more support if you're keen enough.

Re: Using more than one JDK / Java prototype

de Anton Dil - lunes, 21 de julio de 2025, 19:58

Hi Richard, I suspected that was the answer. I think I can live with it for now. :-) I might have a go at some point.

If we are using this in an exam, for now I'll either not show the feedback (I assume that would work) or let students know to expect it.

Re: Using more than one JDK / Java prototype

de Anton Dil - martes, 30 de septiembre de 2025, 23:42

Hi Richard, A postscript on this. I noticed a while ago that my template prints the separator string without a newline character on the end, in java using System.out.print rather than println. I probably arrived at that after some trial and error. I hope it's the right choice.

I also see in the documentation that "trailing whitespace" is stripped before a comparison is made with student output, in an exact comparison. However, I think that \t and \r\n characters aren't handled this way, which are both whitespace in java. Maybe the documentation should refer to stripping LF characters? If you have a method that returns say "x" + "\r\n" and print that result, the output becomes x\r rather than x followed by a newline, which is also odd.

Re: Using more than one JDK / Java prototype

de Richard Lobb - jueves, 2 de octubre de 2025, 10:13

Hi Anton. Two issues here.

1. Printing the separator. Combinator questions have a defined separator which is used to split the output. In the authoring form for the default python3 question type, the separator (a PERL regular expression) is defined as
|#<ab@17943918#@>#\n|ms
This includes the newline character, whereas the definition of SEPARATOR in the template does not. Hence you need something like "print(SEPARATOR)" which adds the newline.

However, when I look at the last prototype you posted (questions-M250-WS-PROTO_Java21_Class-20250708-1356.xml) I'm puzzled. It doesn't have the "Combinator" checkbox checked, so isn't a combinator template. Yet the template processes the twig variable TESTCASES as a list of tests. But that variable should be undefined for a non-combinator template! So I don't see how it can work. Would you mind posting xml exports of both your current prototype and a working question, please, so I can figure out what's going on?

2. Stripping white space. Many thanks for raising this issue. For many years now virtually all our CodeRunner questions have used combinator template graders, and furthermore we don't see return or tab characters in the student output. So I was unaware of the issue.

Looking at the source code, I remember that when I first had to clean the output from the student program I had performance issues. PHP5 had horrendously inefficient string handling, both in CPU time and memory use. So I resorted to a roll-your-own character-by-character filtering process that - as you observe - only stripped trailing space characters not all whitespace.

However, PHP7 improved string handling CPU usage by a factor of 10 and reduced memory overhead significantly too. So I've rewritten the clean() function using built-in PHP functions and it's now much clearer. It uses rtrim on each line which behaves as specified - removing all trailing white space. It also turns out to outperform the old version by a factor of 3 - 4 in speed when tested on outputs of over one hundred thousand lines.

My only slight concern is: how many teachers will have programmed around the deficiency of the old questions by adding explicit white space characters like '\r' at the end of lines in their Expected Outputs? Their questions are going to break with the new implementation.

Re: Using more than one JDK / Java prototype

de Anton Dil - jueves, 2 de octubre de 2025, 21:50

Hi Richard,

I don't have quick access to the current xml of the prototype. It's now on a live server I don't have permissions for. But it has been updated to set iscombinator, so TESTCASES is available :-)

We now have System.out.println(SEPARATOR) and the regex you suggest above is also set. I can see by ticking/unticking customise that those features are there.

2. Glad it's not me then and it's good to hear that you have a nice fix for that! I was experimenting to try to understand how it works and I've not seen this issue in practice, but then with enough students all kinds of things happen.

3! I found that if I enable attachments and ask students to upload a file with the same name as I save in the template code (i.e. f"{classname}.java"), that doesn't work.

***Run error***
Traceback (most recent call last):
File "__tester__.python3", line 34, in
with open(f"{classname}.java", "w") as f:
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
PermissionError: [Errno 13] Permission denied: 'Test.java'

So in my attached version I have tried using a different name, which works using the same trick as in your templates of stripping out the 'public' modifier from compilation units, so that javac won't complain about compiling a filename that differs from the class name. I haven't tested it much, but it seems to work.

Thanks!

Java21ClassTemplateCode2-10-25.txt

Re: Using more than one JDK / Java prototype

de Anton Dil - viernes, 3 de octubre de 2025, 02:55

Later: I now find it doesn't seem to be about attachments as such - I have the same issue on another question without attachments, where I have a support file and the same line is giving the same error relating to that support file, as long as 'validate answer' is ticked. Making the same change to the template also fixes that error. I think what led to this was refreshing the support file (dropping in the latest version).