TextTeaser Open Sourced
Written by Lucy Black   
Tuesday, 15 October 2013

TextTeaser, an application that uses Natural Language Processing and Machine Learning to automatically summarize text, has been open sourced.

The best way to appreciate the effectiveness of TextTeaser is try it. You can either copy and paste some text into the window it provides or just enter the url of an article - feel free to try it with one of our news items.

 

 

Once it has speedily created a point-by-point summary you can share a link to it, either as text or an image, or you can use embed code to add it to a webpage.

 

ttgrow400

(click to enlarge)

As its co-creator Jolo Balbin explained in an interview for Programmable Web, he started work on the algorithm as part of his graduate studies and he is very keen to convert it into a real world application. As part of his research he did tests to compared his algorithm to others that performed automatic summarization and found that it performed better. As it uses machine learning in combination with NLP (Natural Language Programming) its performance will improve as more articles are summarized.

TextTeaser's API and its documentation is now on Mashape, with both Freemium and subscription options. For those who want to get involved with the project the code is on GitHub under an MIT license. It requires the sbt build tool and use of the Scala IDE is recommended.

 

textteaserlogo

More Information

TextTeaser

TextTeaser on GitHub

TextTeaser API on Mashape

Related Articles

Handbook of Natural Language Processing (2e)       

Google Helps Tell An Apple From Apple       

Microsoft Web N-gram Services go public       

Grants Awarded To Kivy and NLTK To Boost Python 3

Strongsteam - an online AI API 

 

To be informed about new articles on I Programmer, install the I Programmer Toolbar, subscribe to the RSS feed, follow us on, Twitter, Facebook, Google+ or Linkedin,  or sign up for our weekly newsletter.

 

espbook

 

Comments




or email your comment to: comments@i-programmer.info

 

Banner


AI Propels Python To Top Language on GitHub
30/10/2024

This year's Octoverse Report reveals how AI is expanding on GitHub and that Python has now overtaken JavaScript as the most popular language on GitHub. The use of Jupyter Notebooks has also surged.



DuckDB And Hydra Partner To Get DuckDB Into PostgreSQL
11/11/2024

The offspring of that partnership is pg_duckdb, an extension that embeds the DuckDB engine into the PostgreSQL database, allowing it to handle analytical workloads.


More News

 

Last Updated ( Tuesday, 15 October 2013 )