Communities

Writing
Writing
Codidact Meta
Codidact Meta
The Great Outdoors
The Great Outdoors
Photography & Video
Photography & Video
Scientific Speculation
Scientific Speculation
Cooking
Cooking
Electrical Engineering
Electrical Engineering
Judaism
Judaism
Languages & Linguistics
Languages & Linguistics
Software Development
Software Development
Mathematics
Mathematics
Christianity
Christianity
Code Golf
Code Golf
Music
Music
Physics
Physics
Linux Systems
Linux Systems
Power Users
Power Users

Dashboard
Notifications
Mark all as read
Q&A

How to generate multi-line completions (code generation) with OpenAI?

+4
−0

I have finally received access to a trial account on OpenAI and I took it for a spin. My interest is with code generation, so first tried something simple using the playground:

Q: generate a standard model for Foo
A: public class Foo
     {
         public int Id { get; set; }
         public DateTime DateCreation { get; set; }
         public DateTime? DateModification { get; set; }
         public int User { get; set; }
         public bool Archive { get; set; }
}	
Q: create a standard model for Bar
A: public class Bar
     {
        public int Id { get; set; }
        public DateTime DateCreation { get; set; }
		public DateTime? DateModification { get; set; }
		public int User { get; set; }
		public bool Archive { get; set; }
	}

Q: create a standard model for MyNewModel
A: public class MyNewModel
     {
       public int Id { get; set; }
	public DateTime DateCreation { get; set; }
	public DateTime? DateModification { get; set; }
	public int User { get; set; }
	public bool Archive { get; set; }
}

(the last answer is automatically generated)

Playground parameters are as follows:

Engine = curie (second best after davinci, good results, faster and cheaper)
Temperature = 0 (always the same output for the same input)
Stop sequences = "A:" and "Q:"
Inject start text = "A:"
Inject restart text = "Q:"

This works as expected and seems to rely on create completion via GET.

I switched to code and found OpenAI-API-dotnet which uses the recommended create completion POST API method. My entire code is as follows:

			var api = new OpenAIAPI(engine: Engine.Curie);

			string text = @"
Q: generate a standard model for Foo
A: public class Foo
     {
         public int Id { get; set; }
         public DateTime DateCreation { get; set; }
         public DateTime? DateModification { get; set; }
         public int User { get; set; }
         public bool Archive { get; set; }
}	
Q: create a standard model for Bar
A: public class Bar
     {
        public int Id { get; set; }
        public DateTime DateCreation { get; set; }
		public DateTime? DateModification { get; set; }
		public int User { get; set; }
		public bool Archive { get; set; }
	}
Q: create a standard model for MyNewModel";

			var textList = text.Split(new[] {"Q:", "A:"}, StringSplitOptions.RemoveEmptyEntries);
			var prompt = string.Join("|endoftext|", textList);

			var result = await api.Completions.CreateCompletionAsync(
				prompt, 1024, 0, null, null, null, null,
				null, null);

			var splits = result.ToString().Split("|endoftext|", StringSplitOptions.RemoveEmptyEntries);
			foreach (var line in splits)
			{
				Console.WriteLine(line);
			}

The problem is that the engine seems to take more time to process (and costs more) almost the same input and also does not return any new lines. The output is the following:

|endoftext| public class MyNewModel
     { 		public int Id { get; set; } 		public DateTime DateCreation { get; set; } 		public DateTime? DateModification { get; set; } 		public int User { get; set; } 		public bool Archive { get; set; } 	}
|endoftext|
|endoftext|
|endoftext|
|endoftext|
........... (many more)

The actual generated code is almost what I need, but the new lines are missing. The engine needs to know which is the input and the output for each case, but the docs only mentions |endoftext| separator (I guess it should be used between the cases)

I have checked the API examples, but they only contain trivial examples (i.e. a few words on the same line).

Any idea how to correctly send the text to obtain a similar output as in the playground?

Why does this post require moderator attention?
You might want to add some details to your flag.
Why should this post be closed?

0 comments

1 answer

+3
−0

I have reached OpenAI's support and one way to generate more accurate multiline responses is to use a clear separator between each question and answer block of text. My final working code is the following (### is the separator I have used):

string text = 
@"Q: generate a standard model for Foo
A: public class Foo
{
	public int Id { get; set; }
	public DateTime DateCreation { get; set; }
	public DateTime? DateModification { get; set; }
	public int User { get; set; }
	public bool Archive { get; set; }
}	
###
Q: generate a standard model for Bar
A: public class Bar
{
	public int Id { get; set; }
	public DateTime DateCreation { get; set; }
	public DateTime? DateModification { get; set; }
	public int User { get; set; }
	public bool Archive { get; set; }
}
###
Q: generate a standard model for MyNewModel";

// normalizing newlines
text = text.Replace(Environment.NewLine, "\n");

var result = await api.Completions.CreateCompletionAsync(
	text, 1024, 0, 1, null, 0, 0,
	null, null, "###");

Console.WriteLine(result);

The output is:

A: public class MyNewModel
{
    public int Id { get; set; }
    public DateTime DateCreation { get; set; }
    public DateTime? DateModification { get; set; }
    public int User { get; set; }
    public bool Archive { get; set; }
}

For unknown reasons, I seem to get better formatting when working with the API (correct newlines) than when working with the playground.

Note: this answer was made possible by Adam Rhodes and it was also posted on OpenAI's community

Why does this post require moderator attention?
You might want to add some details to your flag.

0 comments

Sign up to answer this question »