<?xml version="1.0" encoding="utf-8"?><!-- generator="wordpress/2.0.2" -->
<rss version="2.0" 
	xmlns:content="http://purl.org/rss/1.0/modules/content/">
<channel>
	<title>Comments for OpenIndex</title>
	<link>http://www.openindex.org</link>
	<description>Whose Internet is it , anyway?</description>
	<pubDate>Tue, 08 Jul 2008 21:46:45 +0000</pubDate>
	<generator>http://wordpress.org/?v=2.0.2</generator>

	<item>
		<title>Comment on EPIC 2014 by corb</title>
		<link>http://www.openindex.org/2004/12/05/epic-2014/#comment-311</link>
		<pubDate>Fri, 04 Feb 2005 03:20:20 +0000</pubDate>
		<guid>http://www.openindex.org/2004/12/05/epic-2014/#comment-311</guid>
					<description>...like with the new patriot act passed that allows the government to perceive all private information legally.

though the point is simple i think it's a huge step.  the media in this country is a major player in mind control.  more than anything else i think.</description>
		<content:encoded><![CDATA[<p>&#8230;like with the new patriot act passed that allows the government to perceive all private information legally.</p>
<p>though the point is simple i think it&#8217;s a huge step.  the media in this country is a major player in mind control.  more than anything else i think.
</p>
]]></content:encoded>
				</item>
	<item>
		<title>Comment on Musicplasma : The Music Visual Search Engine by musicplasma</title>
		<link>http://www.openindex.org/2004/08/04/musicplasma-the-music-visual-search-engine/#comment-30</link>
		<pubDate>Mon, 04 Oct 2004 15:05:37 +0000</pubDate>
		<guid>http://www.openindex.org/2004/08/04/musicplasma-the-music-visual-search-engine/#comment-30</guid>
					<description>musicPlasma is really not accurate with classical music... but really better with popular music</description>
		<content:encoded><![CDATA[<p>musicPlasma is really not accurate with classical music&#8230; but really better with popular music
</p>
]]></content:encoded>
				</item>
	<item>
		<title>Comment on Tim Berners-Lee&#8217;s semantic web by Tom Booth</title>
		<link>http://www.openindex.org/2004/06/08/tim-berners-lees-semantic-web/#comment-11</link>
		<pubDate>Fri, 13 Aug 2004 16:43:20 +0000</pubDate>
		<guid>http://www.openindex.org/2004/06/08/tim-berners-lees-semantic-web/#comment-11</guid>
					<description>Apparently the script for this Blog strips HTML tags or anything in angle brackets.

So, trying another method, the examples in my last post should have been something like:

&amp;#60;meta name=&quot;DDS&quot; value=&quot;539&quot;&amp;#62;

or

&amp;#60;xml&amp;#62;
  &amp;#60;DDS&amp;#62;539&amp;#60;/DDS&amp;#62;
&amp;#60;/xml&amp;#62;</description>
		<content:encoded><![CDATA[<p>Apparently the script for this Blog strips HTML tags or anything in angle brackets.</p>
<p>So, trying another method, the examples in my last post should have been something like:</p>
<p>&#60;meta name=&#8221;DDS&#8221; value=&#8221;539&#8243;&#62;</p>
<p>or</p>
<p>&#60;xml&#62;<br />
  &#60;DDS&#62;539&#60;/DDS&#62;<br />
&#60;/xml&#62;
</p>
]]></content:encoded>
				</item>
	<item>
		<title>Comment on Tim Berners-Lee&#8217;s semantic web by Tom Booth</title>
		<link>http://www.openindex.org/2004/06/08/tim-berners-lees-semantic-web/#comment-10</link>
		<pubDate>Tue, 10 Aug 2004 17:06:39 +0000</pubDate>
		<guid>http://www.openindex.org/2004/06/08/tim-berners-lees-semantic-web/#comment-10</guid>
					<description>The idea of the &quot;Semantic Web&quot; is certainly way &quot;cool&quot;. But I'm not sure that AI will ever be capable, (any time in the near future anyway) of embeding &quot;a machine-readable explanation or description of the web into the web itself.&quot; without human direction.

That is, the information being &quot;embeded&quot; would, most probably, in my opinion, be in the form of metadata resulting from human evaluation of the content of a resource and humans somehow encodding that evaluation in a &quot;machine readable&quot; form.

Machines don't actually &quot;read&quot; and understand anything. They are only capable of matching character strings. It is up to humans to program the computers and tell them what to do once they find a match.

If &quot;metadata&quot; CONCEPTS were strictly formatted and standardized then computers would be able to recognize &quot;concepts&quot; if those concepts were represented in the form of unique character strings or codes (so they wouldn't be confused with ordinary language or be otherwise ambiguous).

The computer could then be programmed to do things that would appear to be quite intelligent but would actually just be following a program based upon matching a character string.

For this to happen, people would need access to some sort of standard &quot;language&quot; or set of codes that represented &quot;concepts&quot; and these codes would have to be reasonably understandable to humans before there would be any major movement towards adding the appropriate metadata (concept codes) to their websites.

In other words, using the Dewey Decimal System &quot;codes&quot; as an example.

If people simply began adding something like:

&lt;meta name=DDS content=530.1&gt;

or

&lt;img src=&quot;picture.jpg&quot; meta=DDS:530.1&gt;

(Though this later example is not within standard HTML specs. that I know of)

A computer could then be programmed to recognize this as relating to the concept of &quot;Quantum Mechanics&quot; within the concept of &quot;Physics&quot;-530 within the concept of &quot;Science&quot;-500 even if the resource did not make mention of any of these &quot;KEY WORDS&quot;, or any words at all, The resource might infact be a picture of a particle interaction within a bubble chamber as part of a Physics experiment with no words at all, yet the computer would &quot;know&quot; or recognize from the metadata that the resource was related to Quantum Physics.

Of course, in this example, It might be that one person used DDS for metadata while someone else used Library of Congress codes, while someone else used subject terms (Key words), while someone else used another system of categorization altogether... So in order for something like that to work the &quot;Metadata&quot; codes must be strictly defined and standardized.

And, of course, using DDS is probably out due to possible copyright issues. The LC system is not comprehensive enough to categorize internet resources. So probably some entirely new set of metadata codes would need to be standardized.

XML goes some distance in this direction but I'm not sure the potential XML &quot;codes&quot; are  sufficiently standardized nor is XML easily understandable for the ordinary web master to implement.

In other words, whatever system of encodding data is used for this &quot;semantic web&quot;, ideally, it should be both computer readable as well as reasonably understandable to humans who must do the actual conceptualization and encodding so as to make the &quot;concepts&quot; computer readable.&lt;/meta&gt;</description>
		<content:encoded><![CDATA[<p>The idea of the &#8220;Semantic Web&#8221; is certainly way &#8220;cool&#8221;. But I&#8217;m not sure that AI will ever be capable, (any time in the near future anyway) of embeding &#8220;a machine-readable explanation or description of the web into the web itself.&#8221; without human direction.</p>
<p>That is, the information being &#8220;embeded&#8221; would, most probably, in my opinion, be in the form of metadata resulting from human evaluation of the content of a resource and humans somehow encodding that evaluation in a &#8220;machine readable&#8221; form.</p>
<p>Machines don&#8217;t actually &#8220;read&#8221; and understand anything. They are only capable of matching character strings. It is up to humans to program the computers and tell them what to do once they find a match.</p>
<p>If &#8220;metadata&#8221; CONCEPTS were strictly formatted and standardized then computers would be able to recognize &#8220;concepts&#8221; if those concepts were represented in the form of unique character strings or codes (so they wouldn&#8217;t be confused with ordinary language or be otherwise ambiguous).</p>
<p>The computer could then be programmed to do things that would appear to be quite intelligent but would actually just be following a program based upon matching a character string.</p>
<p>For this to happen, people would need access to some sort of standard &#8220;language&#8221; or set of codes that represented &#8220;concepts&#8221; and these codes would have to be reasonably understandable to humans before there would be any major movement towards adding the appropriate metadata (concept codes) to their websites.</p>
<p>In other words, using the Dewey Decimal System &#8220;codes&#8221; as an example.</p>
<p>If people simply began adding something like:</p>
<p><meta name=DDS content=530.1></p>
<p>or</p>
<p><img src="picture.jpg" meta=DDS:530.1></p>
<p>(Though this later example is not within standard HTML specs. that I know of)</p>
<p>A computer could then be programmed to recognize this as relating to the concept of &#8220;Quantum Mechanics&#8221; within the concept of &#8220;Physics&#8221;-530 within the concept of &#8220;Science&#8221;-500 even if the resource did not make mention of any of these &#8220;KEY WORDS&#8221;, or any words at all, The resource might infact be a picture of a particle interaction within a bubble chamber as part of a Physics experiment with no words at all, yet the computer would &#8220;know&#8221; or recognize from the metadata that the resource was related to Quantum Physics.</p>
<p>Of course, in this example, It might be that one person used DDS for metadata while someone else used Library of Congress codes, while someone else used subject terms (Key words), while someone else used another system of categorization altogether&#8230; So in order for something like that to work the &#8220;Metadata&#8221; codes must be strictly defined and standardized.</p>
<p>And, of course, using DDS is probably out due to possible copyright issues. The LC system is not comprehensive enough to categorize internet resources. So probably some entirely new set of metadata codes would need to be standardized.</p>
<p>XML goes some distance in this direction but I&#8217;m not sure the potential XML &#8220;codes&#8221; are  sufficiently standardized nor is XML easily understandable for the ordinary web master to implement.</p>
<p>In other words, whatever system of encodding data is used for this &#8220;semantic web&#8221;, ideally, it should be both computer readable as well as reasonably understandable to humans who must do the actual conceptualization and encodding so as to make the &#8220;concepts&#8221; computer readable.</meta>
</p>
]]></content:encoded>
				</item>
	<item>
		<title>Comment on The Visual Thesaurus, a Dictionary of the English Language by Tom Booth</title>
		<link>http://www.openindex.org/2004/08/04/the-visual-thesaurus-a-dictionary-of-the-english-language/#comment-9</link>
		<pubDate>Tue, 10 Aug 2004 15:58:15 +0000</pubDate>
		<guid>http://www.openindex.org/2004/08/04/the-visual-thesaurus-a-dictionary-of-the-english-language/#comment-9</guid>
					<description>Now that was cool.

There was a thread in &quot;The Well&quot; some time back where it was stated that this sort of &quot;VISUALIZATION&quot; would be the future in terms of returning search engine results.

Imagine the same sort of functionality, but instead of a simple Thesaurus or Dictionary the &quot;visualization&quot; of interconnected ideas would actually end up leading to web sites within the chosen categories rather than just definitions of terms.

A kind of visual web of the web itself. A means of seeing how web sites are interrelated rather than just words.</description>
		<content:encoded><![CDATA[<p>Now that was cool.</p>
<p>There was a thread in &#8220;The Well&#8221; some time back where it was stated that this sort of &#8220;VISUALIZATION&#8221; would be the future in terms of returning search engine results.</p>
<p>Imagine the same sort of functionality, but instead of a simple Thesaurus or Dictionary the &#8220;visualization&#8221; of interconnected ideas would actually end up leading to web sites within the chosen categories rather than just definitions of terms.</p>
<p>A kind of visual web of the web itself. A means of seeing how web sites are interrelated rather than just words.
</p>
]]></content:encoded>
				</item>
	<item>
		<title>Comment on Open WorldCat pilot [OCLC] by Paul Wilford</title>
		<link>http://www.openindex.org/2004/07/06/open-worldcat-pilot-oclc/#comment-8</link>
		<pubDate>Thu, 05 Aug 2004 04:01:34 +0000</pubDate>
		<guid>http://www.openindex.org/2004/07/06/open-worldcat-pilot-oclc/#comment-8</guid>
					<description>The OCLC is a non-profit cooperative of member libraries, so I assume fees (if any) are due to expenses which are distributed among the members libraries - not the users.

There is a fee (according to the referenced State University of New York web site) for searching via the FirstSearch research service.  This is a service which accesses WorldCat among other databases (such as Medline and ERIC).  FirstSearch isn't WorldCat: it's a research service which access WorldCat.

The search engines have been carrying a subset of WorldCat data as a pilot project.  OCLC hopes to make search engine access of WorldCat a permanent part of their service to member libraries.</description>
		<content:encoded><![CDATA[<p>The OCLC is a non-profit cooperative of member libraries, so I assume fees (if any) are due to expenses which are distributed among the members libraries - not the users.</p>
<p>There is a fee (according to the referenced State University of New York web site) for searching via the FirstSearch research service.  This is a service which accesses WorldCat among other databases (such as Medline and ERIC).  FirstSearch isn&#8217;t WorldCat: it&#8217;s a research service which access WorldCat.</p>
<p>The search engines have been carrying a subset of WorldCat data as a pilot project.  OCLC hopes to make search engine access of WorldCat a permanent part of their service to member libraries.
</p>
]]></content:encoded>
				</item>
	<item>
		<title>Comment on Tim Berners-Lee&#8217;s semantic web by Paul Wilford</title>
		<link>http://www.openindex.org/2004/06/08/tim-berners-lees-semantic-web/#comment-7</link>
		<pubDate>Wed, 04 Aug 2004 08:16:06 +0000</pubDate>
		<guid>http://www.openindex.org/2004/06/08/tim-berners-lees-semantic-web/#comment-7</guid>
					<description>I think there is a 'coolness' factor that is not being sufficiently addressed.  The semantic web (Internet) is meant to be understood by machines, and that's cool because....

...This is a route to AI on the web. It embeds a machine-readable explanation or description of the web into the web itself.

It isn't meant for humans to use directly. It needs applications which make use of it.  Maybe some of those will reach the average user and his web page.

The idea of what those applications might be (or become) excites the imagination, and accounts for the coolness factor.  That doesn't seem too be mentioned much, and feels disingenuous.</description>
		<content:encoded><![CDATA[<p>I think there is a &#8216;coolness&#8217; factor that is not being sufficiently addressed.  The semantic web (Internet) is meant to be understood by machines, and that&#8217;s cool because&#8230;.</p>
<p>&#8230;This is a route to AI on the web. It embeds a machine-readable explanation or description of the web into the web itself.</p>
<p>It isn&#8217;t meant for humans to use directly. It needs applications which make use of it.  Maybe some of those will reach the average user and his web page.</p>
<p>The idea of what those applications might be (or become) excites the imagination, and accounts for the coolness factor.  That doesn&#8217;t seem too be mentioned much, and feels disingenuous.
</p>
]]></content:encoded>
				</item>
	<item>
		<title>Comment on LLRX &#8212; Coming Soon &#65533; the Death of Search Engines? by Tom Booth</title>
		<link>http://www.openindex.org/2004/07/06/llrx-coming-soon-the-death-of-search-engines/#comment-6</link>
		<pubDate>Fri, 09 Jul 2004 17:52:07 +0000</pubDate>
		<guid>http://www.openindex.org/2004/07/06/llrx-coming-soon-the-death-of-search-engines/#comment-6</guid>
					<description>The concluding remarks: &quot;...the Internet ...is also about to cement its position as yet another mass medium &quot; as &quot;free&quot; search engines give way to paid content providers I think is false and seems to imply that search engine results won't or can't get any better in the future.

I believe the solution to the information overload and irrelevant search results described in the article is not to change or improve search engine capabilities per-se but to improve metadata indexing on the part of web-masters and internet content providers so that the search engines have something relavant to look for in terms of the value and importance of information.

Some of the most important meta-data that should be incorporated into a web-site as well as LINKS to a web-site are:

Subject (of course), Location (if relevant such as for a &quot;brick and mortar&quot; business or an scheduled event in the real world) original date posted, source, target audiance, exspiration (When information looses its value, for example: a page about an event becomes irrelevant for most purposes the day after the event.), Some indication of the web-page's importance for the field of study to which the page relates might be indicated. (for example a &quot;Links&quot; page or any link anywhere could include metadata that indicates the relative importance of each page or link.)

Information (links) could also be ranked according to the credibility of the source (in the opinion of the site providing the link.

Search engines could give an information source some priority if several other sites give a high ranking to the information in terms of importance, credibility, urgency(time, exspiration etc.)

The regular incorporation of such metadata on web sites and links to web sites would provide more criteria for search engine selection of information rather than relying on &quot;key words&quot; alone.

Unfortunately the current structure for including &quot;matadata&quot; on a web page is not standardized. There is, as yet, no agreed-upon method or standard for including such information, therefore the search engines don't &quot;know&quot; what to look for.</description>
		<content:encoded><![CDATA[<p>The concluding remarks: &#8220;&#8230;the Internet &#8230;is also about to cement its position as yet another mass medium &#8221; as &#8220;free&#8221; search engines give way to paid content providers I think is false and seems to imply that search engine results won&#8217;t or can&#8217;t get any better in the future.</p>
<p>I believe the solution to the information overload and irrelevant search results described in the article is not to change or improve search engine capabilities per-se but to improve metadata indexing on the part of web-masters and internet content providers so that the search engines have something relavant to look for in terms of the value and importance of information.</p>
<p>Some of the most important meta-data that should be incorporated into a web-site as well as LINKS to a web-site are:</p>
<p>Subject (of course), Location (if relevant such as for a &#8220;brick and mortar&#8221; business or an scheduled event in the real world) original date posted, source, target audiance, exspiration (When information looses its value, for example: a page about an event becomes irrelevant for most purposes the day after the event.), Some indication of the web-page&#8217;s importance for the field of study to which the page relates might be indicated. (for example a &#8220;Links&#8221; page or any link anywhere could include metadata that indicates the relative importance of each page or link.)</p>
<p>Information (links) could also be ranked according to the credibility of the source (in the opinion of the site providing the link.</p>
<p>Search engines could give an information source some priority if several other sites give a high ranking to the information in terms of importance, credibility, urgency(time, exspiration etc.)</p>
<p>The regular incorporation of such metadata on web sites and links to web sites would provide more criteria for search engine selection of information rather than relying on &#8220;key words&#8221; alone.</p>
<p>Unfortunately the current structure for including &#8220;matadata&#8221; on a web page is not standardized. There is, as yet, no agreed-upon method or standard for including such information, therefore the search engines don&#8217;t &#8220;know&#8221; what to look for.
</p>
]]></content:encoded>
				</item>
	<item>
		<title>Comment on Open WorldCat pilot [OCLC] by Tom Booth</title>
		<link>http://www.openindex.org/2004/07/06/open-worldcat-pilot-oclc/#comment-5</link>
		<pubDate>Fri, 09 Jul 2004 16:50:41 +0000</pubDate>
		<guid>http://www.openindex.org/2004/07/06/open-worldcat-pilot-oclc/#comment-5</guid>
					<description>According to information on this site: http://www.oclc.org/worldcat/pilot/facts/default.htm there will be a transition to some form of &quot;membership&quot; service.: &quot;Next steps
Transition Open WorldCat from pilot service to permanent membership...&quot;

No mention is made in regard to membership fees if any. Though I believe library membership in OCLC is already a requirement: &quot;To participate in the Open WorldCat pilot, a library must have contributed records and/or ownership (holdings) information to WorldCat.&quot;

Will there be an additional fee on top of the current OCLC library membership fee?

One site mentions: &quot;When you want to do a search, you access the databases through OCLC's fee for service (means not free!) system&quot; - http://www.oneonta.edu/library/critical/INFOLIT/course/libsys/lim6p1.htm</description>
		<content:encoded><![CDATA[<p>According to information on this site: <a href='http://www.oclc.org/worldcat/pilot/facts/default.htm' rel='nofollow'>http://www.oclc.org/worldcat/pilot/facts/default.htm</a> there will be a transition to some form of &#8220;membership&#8221; service.: &#8220;Next steps<br />
Transition Open WorldCat from pilot service to permanent membership&#8230;&#8221;</p>
<p>No mention is made in regard to membership fees if any. Though I believe library membership in OCLC is already a requirement: &#8220;To participate in the Open WorldCat pilot, a library must have contributed records and/or ownership (holdings) information to WorldCat.&#8221;</p>
<p>Will there be an additional fee on top of the current OCLC library membership fee?</p>
<p>One site mentions: &#8220;When you want to do a search, you access the databases through OCLC&#8217;s fee for service (means not free!) system&#8221; - <a href='http://www.oneonta.edu/library/critical/INFOLIT/course/libsys/lim6p1.htm' rel='nofollow'>http://www.oneonta.edu/library/critical/INFOLIT/course/libsys/lim6p1.htm</a>
</p>
]]></content:encoded>
				</item>
	<item>
		<title>Comment on Tim Berners-Lee&#8217;s semantic web by Tom Booth</title>
		<link>http://www.openindex.org/2004/06/08/tim-berners-lees-semantic-web/#comment-4</link>
		<pubDate>Tue, 08 Jun 2004 18:19:21 +0000</pubDate>
		<guid>http://www.openindex.org/2004/06/08/tim-berners-lees-semantic-web/#comment-4</guid>
					<description>The concept is good but the structure and implementation is completely unrealistic in my opinion.

XML:Extensible Markup Language, OWL: Ontology Web Language, RDF: Resource Descriptor Framework, URI: Uniform Resource Identifier and DAML: Darpa Agency Mark-up Language. And your average Joe posting a web page is supposed to learn and understand all this and add it to his web site? The articles state that Web Publishers are supposed to add these coded tags and what not to their own websites for this to work.

It (XML) has some usefulnes and is already being used by some giant web-portals that can afford a full time IT staff to work on it and keep it going but I'm afraid that 99.99% of Web Masters will never comprehend it or use it to contribute their page to the &quot;semantic web&quot; in its current proposed form using &quot;standard&quot;(?) XML.

It would have to be greatly simplified and standardised so that just about anyone could learn to understand it and use it in no more time than it takes to learn basic HTML. I would hope much less time than that.

As it is, I've been reading about and studying XML off and on for years and I still can't quite grasp how someone is supposed to use it on their web page in the manner proposed by this &quot;semantic web&quot; idea.</description>
		<content:encoded><![CDATA[<p>The concept is good but the structure and implementation is completely unrealistic in my opinion.</p>
<p>XML:Extensible Markup Language, OWL: Ontology Web Language, RDF: Resource Descriptor Framework, URI: Uniform Resource Identifier and DAML: Darpa Agency Mark-up Language. And your average Joe posting a web page is supposed to learn and understand all this and add it to his web site? The articles state that Web Publishers are supposed to add these coded tags and what not to their own websites for this to work.</p>
<p>It (XML) has some usefulnes and is already being used by some giant web-portals that can afford a full time IT staff to work on it and keep it going but I&#8217;m afraid that 99.99% of Web Masters will never comprehend it or use it to contribute their page to the &#8220;semantic web&#8221; in its current proposed form using &#8220;standard&#8221;(?) XML.</p>
<p>It would have to be greatly simplified and standardised so that just about anyone could learn to understand it and use it in no more time than it takes to learn basic HTML. I would hope much less time than that.</p>
<p>As it is, I&#8217;ve been reading about and studying XML off and on for years and I still can&#8217;t quite grasp how someone is supposed to use it on their web page in the manner proposed by this &#8220;semantic web&#8221; idea.
</p>
]]></content:encoded>
				</item>
</channel>
</rss>
