Suggestion - REQUEST! Dynamic Text Behavior

  • 1
  • Idea
  • Updated 4 months ago
  • (Edited)
I have studied four major video editing packages over 20 years, and I keep coming back to Camtasia because of how easy it is to do what I want to. However, one of the biggest needs in video editing today, and something missing from ALL the packages, is the ability to add DYNAMIC TEXT (there's another name for this, but I can't think of it just now). What I mean is text that appears word-by-word, as the narrator speaks the words.

This might not be easy, but it's something I feel confident the Camtasia Engineers could build, and could add to the Behaviors repertoire. Here are the details, but please feel free to modify them as needed:

1. The user adds a text clip over another video (or audio) clip.
2. Then add the Dynamic Text behavior to that text clip.
3. Next, type in the text for that clip and format it on the page the way it should appear in the final rendering. This can include size, color, font, line breaks, positioning, etc., Whatever is needed to make it fit in with whatever else is on the page and into the style of the video.
4. Next, open the properties panel, on which will be a DYNAMIC TEXT tab, where we find the RECORD controls and directions.
5. When the user clicks the RECORD control (the text clip is highlighted), the playhead will be positioned 3 to 5 seconds before the text clip, and the text on the clip will appear in a "disabled" color (defaults to medium grey but should be selectable), and then playback will begin automatically. (Text appears in the disabled color only during record operations.)
6. Following instructions that are already present on the Properties panel, the user will tap the trigger key (possibly "T" for text? But non-conflicting with other hot keys) at the time they want each next word to appear. As the user types the hotkey, the associated word changes from the disabled color to the primary color.
7. When the playhead reaches the end of the text clip, playback will automatically stop and the timing for the appearance of each word will be recorded with that text clip.
8. Steps 5 through 7 can be repeated if needed, to correct mistimings. Even better would be the ability to see and move special markers on the text clip that show where each word appears.
9. Other behaviors may be optionally added to (each word on) the text clip, such that as the words appear, they can "slide" in with a "bounce." The user should be able to set the default behavior to be applied each time they use the Dynamic Text behavior.

Currently, there is no way to add this behavior in any major package, from Premiere Pro to After Effects to Camtasia to DaVinci Resolve to Final Cut Pro to PowerDirector.

Adding this capability to Camtasia would propel them even farther ahead of the competition.

Photo of dpm39560

dpm39560

  • 3 Posts
  • 0 Reply Likes
  • HOPEFUL! We need this!

Posted 4 months ago

  • 1
Photo of Patrick Gray

Patrick Gray

  • 2 Posts
  • 1 Reply Like
Hello dpm39560.... I agree on the concept of a narration editor that has voice to text processing in it. It seems you are noticing that it would be nice to alter the form (color, font, size, etc.) of the text in your video and this displays as your video is playing. I had the issue of background noise in my videos and I was surprised that it is mentally challenging to simultaneously get your video actions and transitions to be good looking and to make your point and at the same time to get your audio "just right." Anyway, I used Camtasia to create a set of YouTube videos where I describe an add-on for Tech-Smith to create. I thought you might enjoy. Here's the URL for Part 01 of my video set. https://www.youtube.com/watch?v=7kVlaXdYzWg&t=3s
Photo of Rick Stone

Rick Stone

  • 6303 Posts
  • 2997 Reply Likes
Hmmm, I know it's not text, but I'm a big fan of interesting things. Since Halloween just passed, I was recently enjoying listening to Toccata and Fugue in D minor. I'm not sure what was used for this, but there is a very interesting visualization that was used here. One that makes me think it would be very similar to what you are asking about.


Photo of Rick Stone

Rick Stone

  • 6303 Posts
  • 2997 Reply Likes
Hmmm, it seems there is information out there for this. What was used is called the "Music Animation Machine".
Photo of Ed Covney

Ed Covney

  • 702 Posts
  • 397 Reply Likes
I wonder how Al Yankovic did it:  https://www.youtube.com/watch?v=8Gv0H-vPoDc
Photo of Rick Stone

Rick Stone

  • 6303 Posts
  • 2997 Reply Likes
Painstakingly? LOL
Photo of tom

tom

  • 4 Posts
  • 3 Reply Likes
There is a feature on Youtube that tries to interpret speech and attaches an XML millisecond file to your video instance for use as closed captions - but inevitably you will need to correct the flubs and I don't think you can use this outside the youtube.com domain (you can embed the video but without the XML).  Basically you are talking about a SCORM/LMS set up but I don't see how automated text interpretation will work without some significant editing and possibly timing tweaks. There's also the question of visual organization by number of rows etc. Sometimes you want to leave pauses or speak quickly. Even OCR interpretations - which have been around for over 30 years - screw things up a good deal.
Photo of Rick Stone

Rick Stone

  • 6303 Posts
  • 2997 Reply Likes
So that's how it's done! I clicked a link earlier in this thread and watched the video and clicked the CC to view. I noticed the CC words appearing one by one.
Photo of kayakman

kayakman, Champion

  • 7369 Posts
  • 2515 Reply Likes
just curious ... is this anything like you're looking for?

Example Dynamic Text 2019-11-03
https://www.screencast.com/t/e1bQJNR4CY

Photo of kayakman

kayakman, Champion

  • 7369 Posts
  • 2515 Reply Likes
or perhaps ...

Example Dynamic Text Using Callout (Top) Caption (Botton) 2019-11-03
https://www.screencast.com/t/W7jdwScvNs

Photo of Ed Covney

Ed Covney

  • 702 Posts
  • 397 Reply Likes
Excellent!! Now one more favor, how did you do that?
Photo of Joe Morgan

Joe Morgan

  • 8058 Posts
  • 4354 Reply Likes
Simple, create a line of text. Then use animated cropping to expose it.
Photo of Ed Covney

Ed Covney

  • 702 Posts
  • 397 Reply Likes
Joe - I thought "dynamic" meant speech-to-text?
Photo of kayakman

kayakman, Champion

  • 7369 Posts
  • 2515 Reply Likes
Joe's is correct re the callout version; the animations are instantaneous

the captions version was made using  sequential captions, timed to narration, where the extended text is shown as narration advances
(Edited)
Photo of tom

tom

  • 4 Posts
  • 3 Reply Likes
If you turn text into SVG objects a lot programs can do this. I thought the question was about generating closed captions or text directly from recorded speech. I've only seen that in Youtube.com but there may be other vendors. I don't think Camtasia or Captivate can do this. In fact, it's very hard to find good software just to translate a .wav file into a text file. The beta apps on mobile phones are about 70-80% accurate but are not real time. I would have thought a company like Nuance would be the place to look.
(Edited)
Photo of kayakman

kayakman, Champion

  • 7369 Posts
  • 2515 Reply Likes
Camtasia does have a speech-to-text function; look at captions panel, tools ...
Photo of Kelly Rush

Kelly Rush, Product Manager

  • 355 Posts
  • 75 Reply Likes
Hi dpm39560,

Are you talking about something like an animated Kinetic Typography style of video? Here's an example (I haven't screened it for content past the first few seconds, so if there's anything offensive in it, please let me know and I'll find a different example).

https://www.youtube.com/watch?v=J7E-aoXLZGY

Cheers,

Kelly Rush
Product Manager, Camtasia
Photo of Patrick Gray

Patrick Gray

  • 2 Posts
  • 1 Reply Like
Hello Kelly, that is close to what I wanted. I am still suggesting that a lot of people would just prefer a system where Camtasia would display the text version of what Camtasia thought they said in a running line of text style where there was a vertical line superimposed on the text stream. That vertical line would be the time counter which shows where (in time) is that particular word (at least Camtasia's opinion) of where in time is that particular word with time being defined as the time line of the video in question. I know others in this suggestion stream are working on altering the appearance of the text, but I wanted to have an ability to watch that line of text go by and then re-dictate another audio version of what I said, so that I could do this re-dictation in a manner where the audio was "clean" with great clarity to how I said each of the words and no background noise. Since I did this re-dictation right exactly as the words were in the original at least with respect to time, then I could go to the original which would now have 2 narration tracks present. One narration tract would be the original one that was not very pleasing, and the other narration tract would be the one that I just re-dictated. Remember, since Camtasia showed the original narration as a line of text with an indicator line showing time, then the re-dictation narration should simply be able to just be placed into the video directly because all the words would already be properly synchronized (with respect to time). So.... you just select the original video, select the new narration that sounds really good, un-select the original narration that had errors and background noise, and re-render. Now its a nice mp4 with nice video and nice narration. I showed my suggested version of how Camtasia could create a workable version of all this in my YouTube videos.  pg