Tuesday, November 24, 2009

Finally, a National Semantic Technology Roadmap!

Probably, you might have assumed it that I am talking about the US but actually it is envisioned by the Malaysian government on their National Information technology Council (NITC) web site. More details would have helped but I will still give lot of credit to Malaysian government to start thinking in this direction and having the courage to recognize Semantic Technology on their official technology web site under the Ministry of Science, Technology and Innovation. Among other great things, the Malaysian governement also has the credit of creating one of the tallest towers in the world called Petrona Twin Towers. You have to be there to believe it and these towers have also become the epitome of astonishing growth in the last two decades in Malaysia.


While in US, sometimes, we still have to run sessions in conferences like - "Is Semantic Technology for Real? "

Friday, November 20, 2009

The Death of Taxonomies! Is there a role for Semantic Technologies?

Congressional Tweet Word Cloud (research in pr...Image by justgrimes via Flickr
There is a very nice article by CMSwatch about how the practice of building taxonomies is dying - this discussion is more oriented towards content/document-centric world. It basically highlights few trends:
  • Since social computing is being widely adapted by companies so there is no fixed way of categorizing things i.e days of single, heirarchical taxonomies are far behind. People will tag content in any way they want.
  • You should be able to get to information the way you want, which may be different from your colleague's approach.
  • Text mining and auto-tagging software is gradually improving, and extracted terms can be applied as metadata. Metadata needs to be very fluid - cloud like. See the fig. above
  • Metadata architects should really understand the domain
I believe there are some solid arguments as I have seen this trend myself. I have only few things to add:
  • What will be the future of existing large taxonomies?
  • Many companies have already invested in more than one content management system like Sharepoint, Documentum or Opentext. There is a good opportunity for semantic web technologies, basically OWL and RDF, which can unify the taxonomies of different content management systems and provide a single data model to retrieve the content through.
  • We should always be very clear about what the word "Metadata" means in any context. It means different things to different people and has been overused. A metadata in a document-centric world is entirely different from say a metadata in a word processing program.
  • Semantic Web is really an umbrella language for metadata. More thought needs to be given to how this new trend of adding tags/metadata on the fly can be leveraged to add more value to the semantic web or linked data cloud.
Reblog this post [with Zemanta]

Monday, November 16, 2009

Information is Beautiful! What about Semantics?

I got this information from a site called Information-is-Beautiful and decided to share it with you. The site is really about ideas, issues, knowledge, data - visualized! This creative picture captures it all and should make everyone think twice when next time they think about designing "anything" which is interesting, functional, has a form and has plenty of integrity! And yes, what about "Semantics?" Probably, this creative author is also taking "Semantics for granted," at least in this picture, like tens of thousands of businesses worldwide!




Thursday, November 12, 2009

Semantics, Metal Reserves and how long will they last!

The picture below tells you about the state of planet earth's metal reserves!



Basically, this diagram says that there are fifty eight years left when all the metal resources will be exhausted  if the world continues to consume at today's rate. There is nothing to worry about because I am sure that geologists on this planet will figure something out much before that.  I am saying this because many of us worried about the state of oil reserves also in last two decades after many pessimistic predictions - more than half of the time people were "Crying Wolf." In any case, the point I want to talk more about is the role of semantic technology in helping the geologists around the world.

I came across an interesting paper in this context. The key points explaining why semantics is becoming important in geology are:
  • Geology, perhaps more than any other science, has advanced with the aid of pictures. The pictures, which are geological maps shows the distribution of different rock types. The current efforts to improve them have spawned a global e-science initiative to review the semantics of geology.

  • One example of issue of semantics in context of geology is - The US department of agriculture and Forest service and the British Columbia Ministry of Lands, Parks and Environment will describe and classify the same geological phenomenan in a completely different way

  • When working at relatively large scales, geologists are always dependent on data and information gathered and reported by other geologists regarding smaller scale features. Semantics is very critical here also

  • One of the consistent problems in geology is the omni-present "map boundary fault" - an apparent geological continuity along the border of adjoining maps which is nothing more than difference in nomenclature or semantics

  • Research is being done to prototype vocabularies by developing web services which can be used to store, compare, and rank on similarity, decsriptions of models (concepts) and instances (physical entitites and events) of mineral deposits, landslides and landslide hazards.
To go through the full research papaer please go here.
Reblog this post [with Zemanta]

Semantic Advertising or not : Diamonds are forever!

Most of us know the origins of the  "De Beers", one of the largest diamond company out of South Africa, and how its founder got his start by renting water pumps to miners during the diamond rush. It is also believed that the founders  merged their interests in various diamond mines into a single entity that would be powerful enough to control production and perpetuate the illusion of scarcity of diamonds. Rest is history! One thing at least I didn't know was how De Beers is used as an example, as shown below, of  "Bad" ad placements. Basically, it highlights the limitation of existing ad technologies like contextual advertising and behavioral targetting in understanding the meaning of a web page.





Semantic Advertising or Semantic Targeting, which is more about targeting the relevant ad based on the meaning and sentiment of the page has gained lot of traction in last few years. I will recommend everyone to read this article by Scott Brinker which explains four different kinds of Semantic advertising in a very clear way.

In theory, the value of semantic advertising is great and companies like peer39, expert system and few others have gained lots of visibility. The opportunity is large as publishing industry is going through the biggest turmoil in decades because of this new generation who doesn't believe in paying for the content. So a relevant or highly targeted advertisement will end up being a major source of revenue for publishers.

In the end, semantic advertising is still a niche space and not a whole lot is understood by the industry about the effectiveness of this approach. There are not enough demos to show the value of semantic advertisement and publishers still need to be convinced more about the benefits of switching to semantic advertising. It will probably take some more time for semantic advertising to become completetely mainstream but I don't think that it will have any bearing on future of De Beers despite the "flaw" in ad placement - as there is a saying that "it is better to have a diamond with a flaw then to have a pebble without one." "Diamonds are forever" and they will always remain a scarce and sought after resource at least in next few decades.
Reblog this post [with Zemanta]

Thursday, November 5, 2009

Linked Open Data : Can we learn anything from failure of many B2B Exchanges?

"You can't stop an idea whose time has come!" Linked data seems to be that idea in the grand vision of Semantic Web! The growth of Linked data cloud always reminds you that we are in exponential times! The Semantic technology community, including me, believes that Linked Data is the best thing happened to the Semantic Web vision. It makes sense and it is the next step for web and can also contribute significantly, if done right, to the evolution of this civilization. The possibilities are endless! We also have one of the best brains behind this initiative.  Whenever I hear Metcalfe's law (by Bob Metcalfe) , in context of Linked data, which states that "the value of the network is proportional to the square of the connected users of the system" then it always reminds me of those early B2B days during late 90s when the world was about to change. Metcalfe law was very popular during those times also! B2B exchanges were considered the pillars of the new economy and their valuations made us lose our sleep. It seemed to all of us that everything was going to be re-defined and we wanted to be part of it. Well, it didn't happen exactly as we were led to believe! Despite the differences between B2B and Linkedata like one is trasactional and other is knowledge oriented, privately owned vs free, different technologies, different era etc., there are some similarities in both of them. Lets do some introspection so that we don't repeat some of those mistakes!

B2B (Business to Business) exchanges is an entity which brings multiple buyers and sellers to a  marketplace where all kinds of commodity, financial instruments, intellectual property and various other goods can be eletronically traded - web is used as the medium in most of the cases. It has following characterstics:
  • The perceived value followed Metcalfe's law
  • Enterprenuers could fundamentally re-invent how work gets done
  • No longer comparison between big-small and so on
  • Shift of power from producers to consumers who are in control of everything
  • Standardized marketplace and standardized contracts
  • Markets operated at fraction of physical word cost
  • Global reach and one stop shopping
  • Neutrality, transparency, self-regulation, market efficiency, confidentiality and anonymity were other virtues of this marketplace
  • The winner, of a particluar vetical B2B , takes all 
It did have some good principles but a very large number of the B2B exchanges failed within few years. There were few fundamental reasons for their failure:

  • The internet enterpreneurs didn't really understand their place in the overall marketplace
  • What they offered to business was something that already existed - at least in some form
  • Companies had always done business with other businesses and most of these businesses negotiate to lower the price. Adapting those existing processes to an internet format didn't really create anything new or different in the field. Unfortunately, many companies felt that switching to an internet-based sites controlled by third parties was a riskier bet than staying with the current partners and vendors
Few of these exchanges figured it out early and survived by either addressing these issues or by being creative about it - like some of them just focused on small businesses. ECN (Electronic Communication Network) are another success story in the world of B2B exchanges as they allowed a more efficient price discovery mechanism for stocks and currencies.  In the end, B2B exchanges were all about attracting buyers and sellers or producers and consumers which they couldn't do. In a similar way, the success of Linked data cloud will depend on creating a marketplace which should be able to attract producers and consumers or buyers or sellers. The technical design prinicples by Sir Tim Berners Lee are just great and well thought of. We are also aware of the existing issues in the Linked data Cloud like quality of data, what is out there, disambiguations issues, trust of the source, frequency of the update, how to get started, should we always publish in RDF, how do you erase inaccuracy and many others things like this. We had very similar issues, maybe in a different flavor, when Web 1.0 was developing and we have come a long way. We might say that HTML was much easier to adapt then RDF but the success or wide adaption of the original web was not only simplicity of HTML and HTTP protocol but because it was a great sales, marketing and ecommerce tool. In context of Linked data cloud, lets accept the fact that most of the businesses will not be  interested in "the future," they will always be interested in "their future." Their first responsibility will always be their share holders.

Companies need very strong incentives or business case to publish their data to this "Cloud" and write applications to consume the collective intelligence of the nodes in this cloud. Initiatives like exposing government data (UK and US at this point), dbpedia, scientific, map oriented data, music, individual research projects and so many other examples which can be good and interesting from exploration standpoint are already underway. We will also see more invovlement from non-profits and intelligence communities at some point. It is a great effort but still not enough! The Cloud will continue to grow and will add billions of more RDF triples but we need to proactively involve the corporate world. Technology enterpreneurs for Semantic Web or Linked data can't do it alone. They have to partner with the business community and the enterprise. The value proposition of Linked data needs to be articulated to the CXO community and their participation needs to be encouraged. We have to start having more conversations like the "benefits of publishing data to the cloud for the enterprise." Basically, much more on business development, business case analysis, education, marketing then technology alone - technology will always remain important and core. I had written an earlier blog about a new category name for Semantic technology which talks about some aspects. Once we have their attention and buy in then the ecosystem of new tools, applications, security infrastructure, new architecture, developers, APIs, SEO, creative sales and marketing ideas, consulting, outsourcing, selling and buying and many either good things will start maturing. It will be very important to do so in next few years to sustain this momentum of Linked data cloud. It will also attract significant investments in this effort! Don't we think that there are dollars to be spent on the Linked data cloud in the  worldwide IT spending of more than three trillion dollars in 2009. We should be thinking or advocating about the "guidelines" for  publishing to the Linked data Cloud like:

  • The benefits of Linked data and possible use cases.
  • How are they impacting their supply chain? 
  • What are the advantages in reciprocity of links? 
  • What should be the process of identifying the data which needs to be published
  • How will it help their revenue and help in their relationships with partners and customers?
  • What are the security issues? What kind of security tools or approaches which are out there?
  • How will it mitigate risks for the company?
  • What will be the possible compliance and legal issues for them?
  • What data access choices can they have - free access, partial access, sign up , qualification, payment?
  • What are the data packing options like - depth, breath, granularity, freshness etc..
  • Can they have handle on who is accessing this data, frequency of access, redistribution issues?
  • Can they create virtual networks of data clouds for their partners and business customers?
  • Can they promote their data?
  • Can they increase the findability of their data?
  • How can they do it in a phased manner?
  • If they use RDFa then are they part of Linked data?
  • Are they accountable for consistency, organization, correctness of their data?
I can think of so many more questions. A very clear policies and procedures aproach needs to be articulated which will lead to a governance model for publishing to Linked data cloud within an enterprise. Probably, we will have to learn more from the experience of present initiatives like publishing data from the UK government to the cloud. During the early days of web, if any company didn't have a web presence then it was perceived that they were missing the action, opportunity or future revenue. At some point, today's businesses should start looking at Linked data cloud with the same lens.
Reblog this post [with Zemanta]