
180 Park Ave - Building 103
Florham Park, NJ
Enhancing Social TV through Social Media Mining
Junlan Feng, Bernard Renger, Ovidiu Dan
TV Content Analysis,
2012.
[PDF]
[BIB]
{This chapter will give an overview of recent media mining technologies and their applica-
tions to TV-related services. In particular, we will cover three main themes related to Social
TV. First, we will discuss retrieving social media content related to TV program. Some TV
shows such as "Lost" and "Heroes" have titles that are also common terms. Using these
titles as keywords to search for relevant messages will lead to low precision. Conversely,
many shows have long titles, which results in low recall as users don't use the full name
of the shows. Appropriately retrieved social media content can be used to display social
discussions on certain TV shows. Furthermore, retrieving such data with high precision and
recall is crucial to a wide spectrum of applications and data mining algorithms.
The second theme is social media mining for TV services. We will focus on sentiment
analysis and detection of in
uential users in social networks. Analyzing messages for sen-
timent can help uncover the aggregate opinion of a large number of users with respect to
certain TV shows or episodes. It can also be useful in presenting opposing views on the same
subject. Graph analysis can be used to determine the most in
uential users which discuss
certain TV programs.
The third theme is integrating the information mined from social media into Social TV
interfaces. This section describes the technologies that enable a user to interact with social
media through television sets. We will present VoiSTV, a voice-enabled social TV prototype
system created at AT&T Labs Research. We will also discuss previous work in this area, as
well as trends in the industry.}

Voice-Enabled Social TV
Bernard Renger, Junlan Feng, Ovidiu Dan, Hisao Chang, Luciano Barbosa
WWW2011,
2011.
[PDF]
[BIB]
ACM Copyright
The definitive version was published in WWW 2011. , 2011-03-28
{Until today, the TV viewing experience has been very unsocial compared to the World Wide Web. In this demo, we will present a Voice-enabled Social TV system, VoiSTV, which allows users to access twitter through the TV using voice. With this application, a user can receive and send twitter messages (tweets) through the TV while watching TV. Users can input tweets to be sent using spoken language. Beyond accessibility, VoiSTV also provides users metadata information about TV shows such as trends, hot topics, popularity of the show as well as aggregated sentiment of show-related tweets. }
Listen to Me if You can: Tracking user experience of mobile network on social media
Jia Wang, Zihui Ge, Jennifer Yates, Junlan Feng, Jun Xu, Tongqing Qiu
ACM/USENIX Internet Measurement Conference,
2010.
[PDF]
[BIB]
ACM Copyright
(c) ACM, 2010. This is the author's version of the work. It is posted here by permission of ACM for your personal use. Not for redistribution.
The definitive version was published in ACM Internet Measurement Conference , 2010-11-01.
{Social media sites like twitter continue to grow at a fast pace.
People of all generations use social media to exchange messages
and share experiences of their life in a timely fashion.
Most of these sites make their data available. An intriguing
question is can we exploit this real-time and giant data-flow
to improve business in a measurable way. In this paper, we
are particularly interested in tweets (twitter messages) that
are relevant to mobile network performance. We compare
tweets with traditional source of user experience, i.e. customer
care tickets, and correlate both of them with network
incident reports. From our study, we have the following observations.
First, twitter users and users who call customer
service tend to report different types of performance issues.
Second, users on twitter are more accurate and faster to report
network problems that impact user experiences. Third,
tweets can show some short term performance impairments,
which are not recorded in incidents reports. These observations
prove that twitter a complimentary source for monitoring
network performance and their impact on user experiences.}
Relevance Recognition For A Human Machine Dialog System Contextual Question Answering Based On A Normalization Of The Length Of The User Input,
Tue Jun 19 12:52:30 EDT 2012
Disclosed are systems, methods and computer-readable media for controlling a computing device to provide contextual responses to user inputs. The method comprises receiving a user input, generating a set of features characterizing an association between the user input and a conversation context based on at least a semantic and syntactic analysis of user inputs and system responses, determining with a data-driven machine learning approach whether the user input begins a new topic or is associated with a previous conversation context and if the received question is associated with the existing topic, then generating a response to the user input using information associated with the user input and any previous user input associated with the existing topic.
Method And Apparatus For Detecting And Extracting Information From Dynamically Generated Web Pages,
Tue Apr 17 12:51:36 EDT 2012
A method and apparatus for automatically detecting and extracting information from dynamically generated web pages are disclosed. For example, the present method stores user provided information that is entered into a form interface of a web page for a first query. Responsive to the first query, a first response web page is received and stored. The present method then automatically generates a second query to acquire a second response web page that is responsive to the second query. Finally, the present method compares the first response web page and the second response web page. In one embodiment, the present invention extracts information that is dissimilar between the first response web page and the second response web page. This extracted information is deemed to be the pertinent information requested by the user.
System And Method Of Automatically Generating Building Dialog Services By Exploiting The Content And Structure Of Websites,
Tue Jan 03 12:50:00 EST 2012
A method and system are disclosed for providing a dialog interface for a website. The method comprises at each node in a website, computing a summary, a document description and an alias. A dialog manager within a spoken dialog service utilizes the summary, document description and alias for each website node to generate prompts to a user, wherein nodes in the website are matched with user requests. In this manner, a spoken dialog interface to the website content and navigation may be generated automatically.
Automated Call Router For Business Directory Using The World Wide Web,
Tue Jan 03 12:50:00 EST 2012
The embodiments include a system and a method for establishing a communication connection after searching the World Wide Web. The system can include a first communication device for forming at least one communication connection between the first communication device and a second communication device, search means adapted to accept a query, access means can be adapted to search and identify relevant phone number information using the query, and create at least one icon to link the first communication device to a relevant phone number included in the relevant phone number information identified by the query, and reformulate the query if no relevant phone numbers are identified during the search. The system also includes click-to-dial means adapted to establish at least one communication connection from the first communication device to the second communication device.
System And Method Of Automatically Building Dialog Services By Exploiting The Content And Structure Of Websites,
Tue Nov 22 16:06:37 EST 2011
A method and system are disclosed for providing a dialog interface for a website. The method comprises at each node in a website, computing a summary, a document description and an alias. A dialog manager within a spoken dialog service utilizes the summary, document description and alias for each website node to generate prompts to a user, wherein nodes in the website are matched with user requests. In this manner, a spoken dialog interface to the website content and navigation may be generated automatically.
System And Method Of Providing A Spoken Dialogue Interface To A Website,
Tue Nov 15 16:06:32 EST 2011
Disclosed is a system and method for training a spoken dialog service component from website data. Spoken dialog service components typically include an automatic speech recognition module, a language understanding module, a dialog management module, a language generation module and a text-to-speech module. The method includes converting data from a structured database associated with a website to a structured text data set and a structured task knowledge base, extracting linguistic items from the structured database, and training a spoken dialog service component using at least one of the structured text data, the structured task knowledge base, or the linguistic items. The system includes modules configured to implement the method.
Automated Call Router For Business Directory Using The World Wide Web,
Tue Jul 26 16:05:47 EDT 2011
The embodiments include a system, a computer readable medium, and a method for establishing a communication connection after searching the World Wide Web for relevant phone information. The system can include a first communication device for forming at least one communication connection between the first communication device and a second communication device, search means adapted to accept a query, access means adapted to (i) search and identify relevant phone number information using the query (ii) create at least one icon to link the first communication device to a relevant phone number included in the relevant phone number information identified by the query, and (iii) reformulate the query if no relevant phone numbers are identified during the search. The system also includes click-to-dial means adapted to establish at least one communication connection from the first communication device to the second communication device.
Method And Apparatus For Building Sales Tools By Mining Data From Websites,
Tue May 24 16:05:13 EDT 2011
A website mining tool is disclosed that extracts information from, for example, a company's website and presents the extracted information in a graphical user interface (GUI). In one embodiment, web pages from a website are stored in, for example, computer memory and a structure of the web pages is identified. A plurality of blocks of information is then extracted as a function of this structure and a category is assigned to each block of information. The elements in the blocks of information are then displayed, for example to a salesperson, as a function of these categories. In another embodiment, Document Object Modeling parsing is used to identify the structure of the web pages. In yet another embodiment, a support vector machine is used to categorize each block of information.
Method And Apparatus For Detecting And Extracting Information From Dynamically Generated Web Pages,
Tue Jan 25 16:04:24 EST 2011
A method and apparatus for automatically detecting and extracting information from dynamically generated web pages are disclosed. For example, the present method stores user provided information that is entered into a form interface of a web page for a first query. Responsive to the first query, a first response web page is received and stored. The present method then automatically generates a second query to acquire a second response web page that is responsive to the second query. Finally, the present method compares the first response web page and the second response web page. In one embodiment, the present invention extracts information that is dissimilar between the first response web page and the second response web page. This extracted information is deemed to be the pertinent information requested by the user.
System And Method Of Identifying Web Page Semantic Structures,
Tue Aug 24 15:04:32 EDT 2010
The disclosure presents a method, system and computer-readable medium related to automatically analyzing structure for a web page. The method embodiment comprises building a training corpus comprising a broad stylistic coverage of web pages, segmenting a web page into information blocks, identifying semantic categories of the information blocks using the training corpus and applying the identical semantic categories in a web-based tool.
Method And Apparatus For Automatically Building Conversational Systems,
Tue Feb 09 15:03:30 EST 2010
A system and method provides a natural language interface to world-wide web content. Either in advance or dynamically, webpage content is parsed using a parsing algorithm. A person using a telephone interface can provide speech information, which is converted to text and used to automatically fill in input fields on a webpage form. The form is then submitted to a database search and a response is generated. Information contained on the responsive webpage is extracted and converted to speech via a text-to-speech engine and communicated to the person.
System And Method Of Providing A Spoken Dialogue Interface To A Website,
Tue Aug 25 16:07:51 EDT 2009
Disclosed is a system and method for generating a spoken dialog service from website data. Spoken dialog components typically include an automatic speech recognition module, a language understanding module, a dialog management module, a language generation module and a text-to-speech module. These components are capable of being automatically trained from processed website data. A website analyzer converts a website into structured text data set and a structured task knowledge base. The website analyzer further extracts linguistic items from the website data. The dialog components are automatically trained from the structured text data set, structured task knowledge base and linguistic items.
System and method of providing a spoken dialog interface to a website,
Tue May 13 18:12:49 EDT 2008
Disclosed is a system and method for generating a spoken dialog service from website data. Spoken dialog components typically include an automatic speech recognition module, a language understanding module, a dialog management module, a language generation module and a test-to-speech module. These components are capable of being automatically trained from processed website data. A website analyzer converts a website into structured text data set and a structured task knowledge base. The website analyzer further extracts linguistic items from the website data. The dialog components are automatically trained from the structured text data set, structured task knowledge base and linguistic items.