4 Billion online searches per day worldwide (www.comscore.com)
Searches unsuccessful = 20% (J Trant et al, Understanding searches of Online collections)
= 800 million search failures per day
…….with a 10% qualifier to allow for more successful short searches…….
Simulated Gatfol Addition
= 80 million extra successful searches with Gatfol
…..equivalent to the 5th largest standalone search engine in the world…..
Unexpected Gatfol contribution:
….most users avoid long searches after early initial failures….
Gatfol can possibly double the use of longer queries : 10% x 4 billion = 400 million
= plus 80 million searches now successful from previously abandoned searches
= +- 0.5 billion extra Gatfol related successful searches per day
Simulated Gatfol Addition
………equivalent to the 3rd largest standalone search engine in the world……. !
Examples of the length and semantic diversity in what we would like to ask from current search engines:
Real world queries……..we have a problem…
-What is a spade flat head with a worm like body that has black stripes down center of body and moves slow about 2 inches long and 1 centermetre wide the body was all the same length?
-Is there a movie or video that has footage of the deep dark corners of the sea that doesnt have a whole lot going on just some cool looking fish doing their thing?
-Why does an organism that lives on land have more complex mechanisms to maintain a stable internal environment than organisms that live in water?
-There is this roomer going around that says that 7 people have already been eaten at ratanga by the crocidiles at crocidile gorge is it true?
-What are the three terms that are indicative of what kind of food source an animal from which it derives it’s primary of energy supply?
-Which of the following are adaptations to remaining afloat and controlling bouyancy found in many pelagic organisms except?
-What is the asexual reproduction method conducted by starfish in which new organisms can grow from its sections?
-What is the maze of passageways that carries proteins and other materials from one part of the cell to another?
-What are some of the conditions or requirements that distributed database would be better for an organisation?
-Explain what Welty means by the habit of love and tell why this habit might be compared to a worn path?
-Why humans have a higher density of receptors for touch in some areas of skin than in other areas?
-How do put together your bio-ball aquaium filter system i bought used use everything in a a box?
-How does the body’s arrangement of nerve receptors for touch make the Braille system possible?
-Why do the survival of the whole organism depends on the functioning of its individual cells?
– What is the relationship between the carbon in the ocean and the carbon in the soil?
-If the dna sequence is tacgttaccgagctagact then what is the sequence of the messenger rna?
-The effect on ancient cultures and traditions of people of cutting the old-growth forest?
-Why there is more species diversity in southern Florida than there is in northern Alaska?
Actual questions submitted to wiki.answers.com
Gatfol provides massive stealth power in a fraction of a second to enable this level of complicated search input into any data set.
…does adding inverted commas, etc. help?
Boolean operators correlates with WORSE search outcomes….
……in addition – word permutations are so large that we soon get ……
Gatfol is the only possible search-enhance route to go…
TIMELINE PROJECTION TO GATFOL ENVISIONED “REASONED SEARCH”
Humans will “speak to data” semantically in 5-10 years. Gatfol will be exceptionally well placed to provide an integral part of this process as Gatfol Search Technology provides “translation” between human natural language and data. Gatfol enables humans to reason with information.
VOLUME BASED DIAGRAMMATIC PROJECTION TO GATFOL NATURAL LANGUAGE INTELLIGENCE
Unstructured datasets worldwide are growing large and diffuse – keyword searches have become ineffective in returning adequate results. Large search engines currently experience a search failure rate of 20% – 25% for queries of longer lengths, or approximately 900 million online searches per day. Gatfol aims to remedy this.
Who hasn’t felt frustrated by failed long search queries that bring up millions of irrelevant results because the keywords being used are not quite hitting the nail on the head and can’t retrieve the vital information one is looking for? Search engines are supposed to make life easier for us by doing all the dirty work and finding us the best results, but with more and more unstructured data to sift through daily, how can we even hope to find the exact bits of information inside a chaotic and unstructured web? What if we could find a way to make the web more intelligent and use a semantic solution to help us reason and talk-to unstructured data to find what we are looking for more easily? If there was a way to change keyword search into man-to-machine questioning communication, this could dramatically improve page returns on long, complicated search queries.
Say for instance the following search query is input into current search engines:
“What is the name of the small coffee shop with the red shutters behind the train station in the Center of Copenhagen that serves five types of strawberry cheesecake”
If the only relevant reference words on the web are “Maurice’s Deli Copernicus Square”, a word by word hit count gives a zero result. What hasn’t been done and what truly represents the “holy grail” in semantic analysis at this level, is an efficient multi-word to multi-word grammar and semantic transformation. This will make limited keyword search of today, yesterday’s technology.
It is highly likely that humans will “converse with data” semantically in the next 5-10 years. Research is currently being done that would help to facilitate “translation” between search language and data. This is a developing technology that endeavours to enhance the ability to deal with the informality and messiness of everyday verbal communication in its attempt to make more sense of unstructured data.
What would be hugely beneficial to society would be to make this semantic intelligence technology available to the masses. The current development of “semantic tools” could make a huge difference to how the public communicates with the web in future, facilitate communication on the social web and help to improve the extraction of useful data from a very chaotic and complex world of data.
According to an article written for the Economist (Feb 27th 2011), “Data, Data Everywhere”, it is said that while the exposure to abundant data brings huge benefits to society, it also brings a lot of frustrating problems and big headaches. Manipulating and using data for the greater good of society over time is most definitely a worthy cause. An even greater challenge is to not only find a way to enable humans to talk to data on the web, but in accomplishing that, to open the doors to exciting new ways of applying that technology across other data collections more broadly in future.