Will it be clunky? Yes. Does it work? Also yes.
Today I'm going to walk you through how to create a short (keyword here) transcription exercise in Otus. Here are the graphics I am going to use today:
One thing that I would say may be helpful is to put the note name as part of the graphic. You might need to play with the size of the images a bit to get this to work. (Since the original post I've updated the graphics to be all the same size - you can get the full sets on this post: notation graphics)
To begin, create your question in the Item Bank. You will want to use the "Cloze with drag & drop" option under Fill in the Blanks (Cloze). Next, delete all the example text in the Template markup. Using the image button add the treble clef to the markup. You will want to set it to prevent scaling so it doesn't mess with the size of our images.
We're almost done!
We need to change a few of the options, so scroll to the bottom and click on "More options". Make sure that it is selected to be "Exact Match" under scoring type. You can leave spaces blank - so if you add a half note and then leave the next beat blank, that will work when they go to check the answer!
You can change where the note bank is for the question under Layout. Response container position will change this. I like to put mine above, but as long as you are consistent it should be fine wherever you put it. Under Response Container (global) set the width to 55px and the height to 250px. That will make sure they are big enough for the images - the containers themselves can grow with the image, but this way they appear the right size to begin with. If you are using this for practice, you can select the check answer box and Unscored/Practice under Scoring.
Finally, hit the save button at the top of the page! If you haven't yet titled the question, it will prompt you to do that before you leave the question again.
Here is what our question looks like: