010-82115891/5892 021-31200158
Chinese Translation Samples More
¡¤ Jack Welch Lexicon of Leader..
¡¤ Research Report from UK'..
¡¤ Research Report from UK'..
¡¤ Retinal Prosthetic Systems f..
¡¤ Retinal Prosthetic Systems f..
¡¤ Jack Welch Lexicon of Leader..
¡¤ Jack Welch Lexicon of Leader..
¡¤ Research Report from UK'..
¡¤ Research Report from UK'..
¡¤ Neuroprosthetics-Frontiers o..
Chinese Translation Achievements More
¡¤ French Chinese Translation
¡¤ Russian Chinese Translation
¡¤ German Chinese Translation
¡¤ Japanese Chinese Translation
¡¤ Spanish Chinese Translation
¡¤ Italian Chinese Translation
¡¤ Korean Chinese Translation
¡¤ Portuguese Chinese Translation
¡¤ Ukrainian Chinese Translation
¡¤ Arabic Chinese Translation
Chinese Translation Samples > Communications English to Chinese Sample

Searching for Statistical Diagrams_Frontiers of Engineering 2011: Reports on Leading-Edge Engineering from the 2011 Symposium_English to Chinese_English Source_20120027-8

Searching for Statistical Diagrams
Shirley Zhe Chen, Michael J. Cafarella, and Eytan Adar
University of Michigan
INTRODUCTION
Statistical, or data-driven, diagrams are an important method for communicating complex information. For many technical documents, the diagrams may be readers¡¯ only access to the raw data underlying the documents¡¯ conclusions.
Unfortunately, finding diagrams online is very difficult using current search systems. Standard text-based search will only retrieve the diagrams¡¯ enclosing documents. Web image search engines may retrieve some diagrams, but they generally work by examining textual content that surrounds images, thus missing out on many important signals of diagram content (Bhatia et al., 2010; Carberry et al., 2006). Even the text that is present in diagrams has meaning that is hugely dependent on their geometric positioning within the diagram¡¯s frame; a number in the caption means something quite different from the same number in the x-axis scale (Bertin, 1983).
There has been growing commercial interest in making data-driven diagrams more accessible, with data search systems such as SpringerImages (http://www.springerimages.com/ ) and Zanran (http://www.zanran.com/q/ ). While there is a huge amount of research literature on search and image-related topics, diagram search per se is largely unexplored.
In this paper we propose a Web search engine exclusively for data-driven diagrams. As with other Web search engines, our system allows the user to enter keywords into a text box in order to obtain a relevance-ranked list of objects. Our system addresses several challenges that are common among different search engines but that require solutions specifically tailored for data-driven diagrams.
Diagram Corpus Extraction
Obtaining the text of a Web document is usually as easy as downloading and parsing an HTML file; in contrast, statistical diagrams require special processing to extract useful information. They are embedded in PDFs with little to distinguish them from surrounding text, the text embedded in a diagram is highly stylized with meaning that is very sensitive to the text¡¯s precise role, and, because diagrams are often an integral part of a highly engineered document, they can have extensive ¡°implicit hyperlinks¡± in the form of figure references from the body of the surrounding text. Our Diagram Extractor component attempts to recover all of the relevant text for a diagram and determine an appropriate semantic label (caption, y-axis label, etc.) for each string.
Ranking Quality
All search engines must figure out how to score an object¡¯s relevance to a search query, but scoring diagrams for relevance can yield strange and surprising results. We use the metadata extracted from the previous step to obtain search quality that is substantially better than naive methods.
Snippet Generation
Small summaries of the searched-for content, usually called snippets, allow users to quickly scan a large number of results before actually selecting one. Conventional search engines select regions of text from the original documents, while image search engines generally scale down the original image to a small thumbnail. Neither technique can be directly applied to data-driven diagrams.
Obviously, textual techniques will not capture any visual elements. Figure 1 shows that image scaling is also ineffective: although photos and images remain legible at smaller sizes, diagrams quickly become difficult to understand.
This paper describes DiagramFlyer, a search engine for finding data-driven diagrams in Web documents. It addresses each of the above challenges, yielding a search engine that successfully extracts diagram metadata in order to provide both higher-quality ranking and improved diagram ¡°snippets¡± for fast search result scanning.
The techniques we propose are general and can work across diagrams found throughout the Web. However, in our current testbed we concentrate on diagrams extracted from PDFs that were discovered and downloaded from public Web pages on academic Internet domains. Our resulting corpus contains 153,000 PDFs and 319,000 diagrams. We show that DiagramFlyer obtains a 52% improvement in search quality over naive approaches. Furthermore, we show that DiagramFlyer¡¯s hybrid snippet generator allows users to find results 33% more accurately than with a standard image-driven snippet. We also place DiagramFlyer¡¯s intellectual contributions in a growing body of work on domain-independent information extraction¡ªtechniques that enable retrieval of structured data items from unstructured documents, even when the number of topics (or domains) is unbounded.
 
Ô­¼þÏÂÔØ£º
Main Languages More
Reliable Cantonese Translations
Simplified Chinese Translation
Traditional Chinese Translation
English translation
German Translations
French
Professional Scope More
¡¤ Multilingual Solurtions For ..
¡¤ Government And International..
¡¤ Energy Sector Multilingual S..
¡¤ Telecommunications Multiling..
¡¤ IT Multilingual Solutions
¡¤ Language Solutions For The M..
¡¤ Law Firms
¡¤ Banking and Finance
Chinese Translators More
¡¤ Ms. Lou: French-Chinese trans..
¡¤ Ms. Duan: French-Chinese tran..
¡¤ Ms.Wang: French-Chinese trans..
¡¤ Mr. Jin: professional French-..
¡¤ Ms. Wang: French-Chinese tran..

Beijing Address: Room 1507, Building 4, Sun Garden, Haidian District, Beijing. Post Code: 100098
Tel: +86-10-82115891 Fax: +86-10-82115892 Email:beijinghyw@126.com MSN:bjhyw@hotmail.com

Shanghai Address: 20G of No. 38 of Caoxi North Road, Shanghai.Post code: 200030
Tel: 0086-21-31200158 Fax: 0086-21-31200158 Email:shkehu@263.net

Copyright 2007 www.readworld.com All rights reserved