• Skip to primary navigation
  • Skip to main content
  • Skip to primary sidebar
  • Skip to footer

My TechDecisions

  • Best of Tech Decisions
  • Topics
    • Video
    • Audio
    • Mobility
    • Unified Communications
    • IT Infrastructure
    • Network Security
    • Physical Security
    • Facility
    • Compliance
  • RFP Resources
  • Resources
  • Podcasts
  • Project of the Week
  • About Us
    SEARCH
Facility, IT Infrastructure, News

Photo Caption AI Model from IBM Researches Tries Not to Sound Too Robotic

IBM AI researchers have created a photo caption AI model which works to caption images without coming off as too robotic or inaccurate.

June 27, 2019 Adam Forziati 1 Comment

IBM AI, photo caption AI
Perhaps IBM's photo caption AI could have come up with a better bit of text here?

Rare is the chance for us editors to share with you what it is really like to do our jobs. But a recent article from Venture Beat on photo caption AI caught our attention — and we found something in it we can really relate to.

Something you probably wouldn’t know unless you’re a journalist is that us editors constantly have to come up with captions for images. It may seem like such a small thing to complain about, but when we’re already writing and re-writing so many things all day, coming up with yet another way to re-hash what we already said for the sake of a picture having a label can become monotonous.

Fortunately for us lazy editors, IBM AI is here to help.

Photo caption AI could create “humanlike” captions

According to the Venture Beat article, a research paper at the 2019 Conference in Computer Vision and Pattern Recognition by a team of IBM AI researchers describes a model that could craft “diverse, creative, and convincingly humanlike captions.”

“Architecting the system required addressing a chief shortcoming of automatic captioning systems: sequential language generation resulting in syntactically correct — but homogeneous, unnatural, and semantically irrelevant — structures. The coauthors’ approach gets around this with an attention captioning model, which allows the captioner to use fragments of scenes in the photos it’s observing to compose sentences. At every generating step, the team’s AI model has the choice of attending to either visual or textual cues from the last step.” — original Venture Beat article

But the IBM AI researchers wanted to ensure that the captions didn’t sound robotic, so they used two-part neural networks that produce discriminatory samples attempting to distinguish between the generated samples and real examples.

This means the photo caption AI is trained during the captioning process.

Another discriminating function scores the “naturalness” of sentences with a model which matches with generated words, allowing the AI to judge the image and sentence in pairs.

Read Next: Artificial Intelligence Uses Machine Learning to Fake Photos

Researchers say their photo caption AI achieves “good” performance overall. They believe their work makes way for new computer vision systems, which they also wish to explore, says the article.

 

If you enjoyed this article and want to receive more valuable industry content like this, click here to sign up for our digital newsletters!

Tagged With: Artificial Intelligence, Machine Learning

Related Content:

  • Cloud, SASE, Aryaka How the Cloud is Redefining Media Production and…
  • Singlewire Software mass notification interview Singlewire Software on Mass Notification Solutions
  • URI catchbox 1 Catchbox Plus: The Mic Solution That Finally Gave…
  • Engaging virtual meeting with diverse participants discussing creative ideas in a bright office space during daylight hours Diversified Survey: Workplace AV Tech is Falling Short,…

Free downloadable guide you may like:

  • Practical Design Guide for Office SpacesPractical Design Guide for Office Spaces

    Recent Gartner research shows that workers prefer to return to the office for in-person meetings for relevant milestones, as well as for face-to-face time with co-workers. When designing the office spaces — and meeting spaces in particular — enabling that connection between co-workers is crucial. But introducing the right collaboration technology in meeting spaces can […]

Reader Interactions

Trackbacks

  1. IBM (NYSE:IBM) Getting Somewhat Negative Press Coverage, Report Finds - Mayfield Recorder says:
    June 28, 2019 at 6:40 am

    […] Photo Caption AI Model from IBM Researches Tries Not to Sound Too Robotic – TechDecisions (mytechdecisions.com) […]

    Reply

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Latest Downloads

Practical Design Guide for Office Spaces
Practical Design Guide for Office Spaces

Recent Gartner research shows that workers prefer to return to the office for in-person meetings for relevant milestones, as well as for face-to-fa...

New Camera Can Transform Your Live Production Workflow
New Camera System Can Transform Your Live Production Workflow

Sony's HXC-FZ90 studio camera system combines flexibility and exceptional image quality with entry-level pricing.

Creating Great User Experience and Ultimate Flexibility with Clickshare

Working and collaborating in any office environment today should be meaningful, as workers today go to office for very specific reasons. When desig...

View All Downloads

Would you like your latest project featured on TechDecisions as Project of the Week?

Apply Today!

More from Our Sister Publications

Get the latest news about AV integrators and Security installers from our sister publications:

Commercial IntegratorSecurity Sales

AV-iQ

Footer

TechDecisions

  • Home
  • Welcome to TechDecisions
  • Contact Us
  • Comment Guidelines
  • RSS Feeds
  • Twitter
  • Facebook
  • Linkedin

Free Technology Guides

FREE Downloadable resources from TechDecisions provide timely insight into the issues that IT, A/V, and Security end-users, managers, and decision makers are facing in commercial, corporate, education, institutional, and other vertical markets

View all Guides
TD Project of the Week

Get your latest project featured on TechDecisions Project of the Week. Submit your work once and it will be eligible for all upcoming weeks.

Enter Today!
Emerald Logo
ABOUTCAREERSAUTHORIZED SERVICE PROVIDERSYour Privacy ChoicesTERMS OF USEPRIVACY POLICY

© 2025 Emerald X, LLC. All rights reserved.