<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>OnStrategies Perspectives</title>
	<atom:link href="http://www.onstrategies.com/blog/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.onstrategies.com/blog</link>
	<description>Insights on the world of Information Technology -- Views expressed here do not reflect the opinions of Ovum.</description>
	<lastBuildDate>Tue, 31 Jan 2012 13:40:58 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.4</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>EMC’s Hadoop Strategy cuts to the chase</title>
		<link>http://www.onstrategies.com/blog/2012/01/31/emc%e2%80%99s-hadoop-strategy-cuts-to-the-chase/</link>
		<comments>http://www.onstrategies.com/blog/2012/01/31/emc%e2%80%99s-hadoop-strategy-cuts-to-the-chase/#comments</comments>
		<pubDate>Tue, 31 Jan 2012 13:00:35 +0000</pubDate>
		<dc:creator>Tony Baer</dc:creator>
				<category><![CDATA[Big Data]]></category>
		<category><![CDATA[Data Management]]></category>

		<guid isPermaLink="false">http://www.onstrategies.com/blog/?p=833</guid>
		<description><![CDATA[To date, Big Storage has been locked out of Big Data. It’s been all about direct attached storage for several reasons. First, Advanced SQL players have typically optimized architectures from data structure (using columnar), unique compression algorithms, and liberal usage of caching to juice response over hundreds of terabytes. For the NoSQL side, it’s been [...]]]></description>
			<content:encoded><![CDATA[<p>To date, Big Storage has been locked out of Big Data. It’s been all about direct attached storage for several reasons. First, Advanced SQL players have typically optimized architectures from data structure (using columnar), unique compression algorithms, and liberal usage of caching to juice response over hundreds of terabytes. For the <a href="http://en.wikipedia.org/wiki/NoSQL">NoSQL</a> side, it’s been about cheap, cheap, cheap along the Internet data center model: have lots of commodity stuff and scale it out. <a href="http://hadoop.apache.org/">Hadoop</a> was engineered exactly for such an architecture; rather than speed, it was optimized for sheer linear scale.</p>
<p>Over the past year, most of the major platform players have planted their table stakes with Hadoop. Not surprisingly, IT household names are seeking to somehow tame Hadoop and make it safe for the enterprise.</p>
<p>Up till now, anybody with armies of the best software engineers that Internet firms could buy could brute force their way to scale out humungous clusters and if necessary, invent their own technology, then share and harvest from the open source community at will. Hardly a suitable scenario for the enterprise mainstream, the common thread behind the diverse strategies of <a href="http://www-03.ibm.com/systems/data/flash/smartercomputing/bigdata.html?csr=agus_brspsmartcompv2-20110613&#038;cm=k&#038;cr=google&#038;ct=USBRB301&#038;S_TACT=USBRB301&#038;ck=ibm_big_data&#038;cmp=USBRB&#038;mkwid=seZYq31V5_13248866747_432hpc8503">IBM</a>, <a href="http://greenplum.com/">EMC</a>, <a href="http://www.microsoft.com/sqlserver/en/us/solutions-technologies/business-intelligence/big-data-solution.aspx">Microsoft</a>, and <a href="http://www.oracle.com/us/corporate/press/1453721">Oracle</a> toward Hadoop has been to not surprisingly make Hadoop more approachable.</p>
<p>What’s been conspicuously absent so far was a play from Big Optimized Storage. The conventional wisdom is that <a href="http://en.wikipedia.org/wiki/Storage_area_network">SAN</a> or <a href="http://en.wikipedia.org/wiki/Network-attached_storage">NAS</a> are premium, architected systems whose costs might be prohibitive when you talk petabytes of data. Similarly, so far there has been a different operating philosophy behind the first generation implementations from the NoSQL world that assumed that parts would fail, and that five nines service levels were overkill. And anyway, the design of Hadoop brute forced the solution: replicate to have three unique copies of the data distributed around the cluster, as hardware is cheap.</p>
<p>As Big Data gains traction in the enterprise, some of it will certainly fit this pattern of something being better than nothing, as the result is unique insights that would not otherwise be possible. For instance, if your running analysis of Facebook or Twitter goes down, it probably won’t take the business with it. But as enterprises adopt Hadoop – and as pioneers stretch Hadoop to new operational use cases such as <a href="http://www.hadoopworld.com/session/realtime-big-data-at-facebook-with-hadoop-and-hbase/">what Facebook is doing with its messaging system</a> – those concepts of mission-criticality are being revisited.</p>
<p>And so, ever since <a href="http://emc.com/">EMC</a> announced last spring that its <a href="http://greenplum.com/">Greenplum</a> unit would start supporting and bundling different versions of Hadoop, we’ve been waiting for the other shoe to drop: When would EMC infuse its Big Data play with its core DNA, storage?</p>
<p>Today, EMC announced that its <a href="<a href="http://www.ndm.net/emcstore/isilon/isilon?gclid=CP7x6Mqu-q0CFUSo4Aod-FQNrw">Isilon</a> networked storage system was adding native support for Apache Hadoop’s <a href="http://hadoop.apache.org/hdfs/">HDFS</a> file system. There were some interesting nuances to the rollout.</p>
<p><strong>1. Big vendors are feeling their way around Hadoop</strong><br />
It’s interesting to see how IT household names are cautiously navigating their way into unfamiliar territory. EMC becomes the latest, after <a href="http://www.oracle.com/us/corporate/analystreports/infrastructure/ovum-hadoop-cloudera-1496784.pdf">Oracle</a> and Microsoft, to calibrate their Hadoop strategy in public.</p>
<p><a href="http://www.oracle.com/us/corporate/press/512001">Oracle announced its Big Data appliance last fall</a> <em>before</em> it lined up its Hadoop distribution. <a href="http://www.zdnet.com/blog/microsoft/microsoft-closes-in-on-delivering-its-own-hadoop-competitor/11034?tag=content;siu-container">Microsoft ditched its Dryad project</a> built around its HPC Server. Now EMC has recalibrated its Hadoop strategy; when it first unveiled its Hadoop strategy last spring, the spotlight was on the <a href="http://mapr.com/">MapR</a> proprietary alternatives to the HDFS file system of Apache Hadoop. It’s interesting that vendor initial announcements have either been vague, or have been tweaked as they’ve waded into the market. For EMC’s shift, more about that below.</p>
<p><strong>2. What is Hadoop? For EMC, HDFS is the mainstream, not MapR</strong></p>
<p>MapR’s strategy (and IBM’s along with it, regarding <a href="http://www-03.ibm.com/systems/software/gpfs/">GPFS</a>) has prompted debate and concern in the Hadoop community about commercial vendors forking the technology. <a href="http://www.onstrategies.com/blog/2011/11/11/what-will-hadoop-be-when-it-grows-up/">As we’ve ranted previously</a>, Hadoop’s growth will be tied, not only to megaplatform vendors that support it, but the third party tools and solutions ecosystem that grows around it. For such a thing to happen, ISVs and consulting firms need to have a common target to write against, and having forked versions of Hadoop won&#8217;t exactly grow large partner communities.</p>
<p>Regarding EMC, the original strategy was two Greenplum Hadoop editions: a Community Edition with a free Apache distro and an Enterprise Edition that bundled MapR, both under the Greenplum HD branding umbrella. At first blush, it looked like EMC was going to earn the bulk of its money from the proprietary side of the Hadoop business. What’s significant is that the new announcement of Isilon support pertains on to the HDFS open source side. More to the point, EMC is rebranding and subtly repositioning its Greenplum Hadoop offerings:  Greenplum HD is the Apache HDFS edition with the optional Isilon support, and Greenplum MR is the MapR version, which is niche targeted towards advanced Hadoop use cases that demand higher performance. </p>
<p>Coming atop recent announcements from Oracle and Microsoft that have come clearly out on the side of OEM’ing Apache rather than anything limited or proprietary, and this amounts to an unqualified endorsement of Apache Hadoop/HDFS as not only the formal, but also the de facto standard. This reflects emerging conventional wisdom that the enterprise mainstream is leery about lock-in to anything that smells proprietary for technology where they still are in the learning curve. Other forks may emerge, but they will not be at the base file system layer. This leaves IBM and MapR pigeonholed – admittedly, there will be API compatibility, but clearly both are swimming upstream.</p>
<p><strong>3. Central Storage is newest battleground for Scale Up vs. Scale Out Hadoop</strong></p>
<p>As noted earlier, Hadoop’s heritage has been the classic Internet data center scale-out model. The advantage is that, leveraging Hadoop’s highly linear scalability, organizations could easily expand their clusters quite easily by plucking more commodity server and disk. Pioneers or purists would scoff at the notion of an appliance approach because it was always simply scaling out inexpensive, commodity hardware, rather than paying premiums for big vendor boxes.</p>
<p>In blunt terms, the choice is whether you pay now or pay later. As mentioned before, do-it-yourself compute clusters require sweat equity – you need engineers who know how to design, deploy, and operate them. The flipside is that many, arguably most corporate IT organizations either lack the skills or the capital. There are various solutions to what might otherwise appear a <a href="http://en.wikipedia.org/wiki/Hobson's_choice">Hobson’s Choice</a>:<br />
•	Go to a cloud service provider that has already created the infrastructure, such as what Microsoft is offering with its <a href="http://www.zdnet.com/blog/microsoft/understanding-microsofts-big-picture-plans-for-hadoop-and-project-isotope/11466">Hadoop-on-Azure</a> services;<br />
•	Look for a happy, simpler medium such as <a href="http://aws.amazon.com/elasticmapreduce/">Amazon’s Elastic MapReduce</a> on its <a href="http://aws.amazon.com/dynamodb/">DynamoDB</a> service;<br />
•	Subscribe to SaaS providers that offer Hadoop applications (e.g., social network analysis, smart grid as a service) as a service;<br />
•	Get a platform and have a systems integrator put it together for you (key to <a href="http://www-01.ibm.com/software/data/infosphere/biginsights/">IBM&#8217;s BigInsights</a> offering, and applicable to any SI that has a Hadoop practice)<br />
•	Go to an appliance or engineered systems approach that puts Hadoop and/or its subsystems in a box, such as with <a href="http://www.oracle.com/us/corporate/features/feature-obda-498724.html">Oracle Big Data Appliance</a> or EMC’s <a href="http://www.greenplum.com/products/greenplum-dca">Greenplum DCA</a>. The systems engineering is mostly done for you, but the increments for growing the system can be much larger than simply adding a few x86 servers here or there (Greenplum HD DCA can scale in groups of 4 server modules). Entry or expansion costs are not necessarily cheap, but then again, you have to balance capital cost against labor.<br />
•	Surrounding Hadoop infrastructure with solutions. This is not a mutually exclusive strategy; unless you’re <a href="http://cloudera.com">Cloudera</a> or <a href="http://hortonworks.com">Hortonworks</a>, which make their business bundling and supporting the core Apache Hadoop platform, most of the household names will bundle frameworks, algorithms, and eventually solutions that in effect place Hadoop under the hood. For EMC, the strategy is their recent announcement of a <a href="http://www.greenplum.com/products/greenplum-uap">Unified Analytics Platform (UAP)</a> that provides <a href="http://www.greenplum.com/products/chorus">collaborative development capabilities for Big Data applications</a>. EMC is (or will be) hardly alone here.</p>
<p>With EMC’s new offering, the scale-up option tackles the next variable: storage. This is the natural progression of a market that will address many constituencies, and where there will be no single silver bullet that applies to all.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.onstrategies.com/blog/2012/01/31/emc%e2%80%99s-hadoop-strategy-cuts-to-the-chase/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Oracle fills another gap in its Big Data offering</title>
		<link>http://www.onstrategies.com/blog/2012/01/10/oracle-fills-another-gap-in-its-big-data-offering/</link>
		<comments>http://www.onstrategies.com/blog/2012/01/10/oracle-fills-another-gap-in-its-big-data-offering/#comments</comments>
		<pubDate>Tue, 10 Jan 2012 19:35:12 +0000</pubDate>
		<dc:creator>Tony Baer</dc:creator>
				<category><![CDATA[Big Data]]></category>
		<category><![CDATA[Business Intelligence]]></category>
		<category><![CDATA[Database]]></category>

		<guid isPermaLink="false">http://www.onstrategies.com/blog/?p=811</guid>
		<description><![CDATA[When we last left Oracle’s Big Data plans, there was definitely a missing piece. Oracle’s Big Data Appliance as initially disclosed at last fall’s OpenWorld was a vague plan that appeared to be positioned primarily as an appliance that would accompany and feed data to Exadata. Oracle did specify some utilities, such as an enterprise [...]]]></description>
			<content:encoded><![CDATA[<p>When we last left <a href="http://oracle.com/">Oracle’s</a> Big Data plans, there was definitely a missing piece. <a href="http://www.oracle.com/us/corporate/features/feature-obda-498724.html">Oracle’s Big Data Appliance</a> as initially disclosed at last fall’s OpenWorld was a vague plan that appeared to be positioned primarily as an appliance that would accompany and feed data to <a href="http://www.oracle.com/us/products/database/exadata-database-machine/overview/index.html">Exadata</a>. Oracle did specify some utilities, such as an enterprise version of the open source <a href="http://www.r-project.org/">R</a> statistical processing program that was designed for multithreaded execution, plus a distribution of a <a href="http://en.wikipedia.org/wiki/NoSQL">NoSQL</a> database based on Oracle’s <a href="http://www.oracle.com/technetwork/database/berkeleydb/overview/index.html">BerkeleyDB</a> as an alternative to Apache <a href="http://hive.apache.org/">Hive</a>. But the emphasis appeared to be extraction and transformation of data for Exadata via Oracle’s own utilities that were optimized for its platform.</p>
<p>As such, Oracle’s plan for <a href="http://hadoop.apache.org/">Hadoop</a> was competition, not for <a href="http://cloudera.com/">Cloudera</a> (or <a href="http://hortonworks.com">Hortonworks</a>), which featured a full Apache Hadoop platform, but EMC which offered a comparable, appliance-based strategy that pairs Hadoop with an Advanced SQL data store; and IBM, which took a different approach by emphasizing Hadoop as an analytics platform destination enhanced with text and predictive analytics engines, and other features such as unique query languages and file systems.</p>
<p>Oracle’s initial Hadoop blueprint lacked explicit support of many pieces of the Hadoop stack such as <a href="http://hbase.apache.org/">HBase</a>, Hive, <a href="http://pig.apache.org/">Pig</a>, <a href="http://zookeeper.apache.org/">Zookeeper</a>, and <a href="http://avro.apache.org/">Avro</a>. No more. With Oracle’s announcement of general availability of the Big Data appliance, it is filling in the blanks by disclosing that it is OEM’ing Cloudera’s CDH Hadoop distribution, and more importantly, the management tooling that is key to its revenue stream. For Oracle, OEM’ing Cloudera’s Hadoop offering fully fleshes out its Hadoop distribution and positions it as a full-fledged analytic platform in its own right; for Cloudera, the deal is a coup that will help establish its distribution as the reference. It is fully consistent with Cloudera’s goal to become the <a href="http://redhat.com/">Red Hat</a> of Hadoop as it does not aspire to spread its footprint into applications or frameworks. </p>
<p>Of course, whenever you put Oracle in the same sentence as OEM deal, the question of acquisition inevitably pops up. There are several reasons why an Oracle acquisition of Cloudera is unlikely.</p>
<p>1. Little upside for Oracle. While Oracle likes to assert maximum control of the stack, from software to hardware, its foray into <a href="http://www.oracle.com/us/technologies/linux/index.html">productizing its own support for</a> <a href="http://www.redhat.com/rhel/">Red Hat Enterprise Linux</a> has been strictly defensive; its offering has not weakened Red Hat.</p>
<p>2. Scant leverage. Compare Hadoop to <a href="http://www.mysql.com/">MySQL</a> and you have a Tale of Two Open Source projects. One is hosted and controlled by Apache, the other is hosted and controlled by Oracle. As a result, while Oracle can change licensing terms for MySQL, which it owns, it has no such control over Hadoop. Were Oracle to buy Cloudera, another provider could easily move in to fill the vacuum. The same would happen to Cloudera if, as a prelude to such a deal, it began forking from the Apache project with its own proprietary adds-ons or substitutions.</p>
<p>OEMs deals are a major stage of building the market. Cloudera has used its first mover advantage with Hadoop well with deals <a href="http://dell.com/">Dell</a>, and now Oracle. <a href="http://www.microsoft.com/">Microsoft</a> in turn has decided to keep the “competition” honest by signing up Hortonworks to (eventually) deliver the Hadoop engine for <a href="http://www.windowsazure.com/en-us/">Azure</a>.</p>
<p>OEM deals are important for attaining another key goal in developing the Hadoop market: defining the core stack – <a href="http://www.onstrategies.com/blog/2011/11/11/what-will-hadoop-be-when-it-grows-up/">as we’ve ranted</a> about previously. Just as Linux took off once a robust kernel was defined, the script will be identical for Hadoop. With <a href="http://www-01.ibm.com/software/data/infosphere/biginsights/">IBM</a> and <a href="http://www.greenplum.com/products/greenplum-hd">EMC</a>/<a href="http://gigaom.com/cloud/startup-mapr-underpins-emcs-hadoop-effort/">MapR</a> forking the Apache stack at the core file system level, and with niche providers like Hadapt offering replacement for HBase and Hive, there is growing variability in the Hadoop stack. However, to develop the third party ecosystem that will be vital to the development of Hadoop, a common target (and APIs for where the forks occur) must emerge. A year from now, the outlines of the market’s decision on what makes Hadoop Hadoop will become clear.</p>
<p>The final piece of the trifecta will be commitments from the <a href="http://www.accenture.com/us-en/pages/service-technology-large-data-warehousing.aspx">Accentures</a> and <a href="http://www.deloitte.com/view/en_US/us/Insights/Browse-by-Content-Type/deloitte-debates/ac39f372efd73310VgnVCM3000001c56f00aRCRD.htm">Deloittes</a> of the world to develop practices based on specific Hadoop platforms. For now they are still keeping their cards close to their vests.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.onstrategies.com/blog/2012/01/10/oracle-fills-another-gap-in-its-big-data-offering/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Who Owns the Product Lifecycle?</title>
		<link>http://www.onstrategies.com/blog/2011/11/20/who-owns-the-product-lifecycle/</link>
		<comments>http://www.onstrategies.com/blog/2011/11/20/who-owns-the-product-lifecycle/#comments</comments>
		<pubDate>Mon, 21 Nov 2011 02:14:06 +0000</pubDate>
		<dc:creator>Tony Baer</dc:creator>
				<category><![CDATA[Application Development]]></category>
		<category><![CDATA[Product Lifecycle]]></category>

		<guid isPermaLink="false">http://www.onstrategies.com/blog/?p=799</guid>
		<description><![CDATA[Turn on the ignition of your car, back out of the parking space and go into drive. As you engaged the transmission, gently tapped the accelerator and stepped on the brake, you didn’t directly interact with the powertrain. Instead, your actions were detected by sensors and executed by actuators on electronics control units that then [...]]]></description>
			<content:encoded><![CDATA[<p>Turn on the ignition of your car, back out of the parking space and go into drive. As you engaged the transmission, gently tapped the accelerator and stepped on the brake, you didn’t directly interact with the powertrain. Instead, your actions were detected by sensors and executed by actuators on electronics control units that then got the car to shift, move, then stop. </p>
<p>Although in the end, <a href="http://blogs.hbr.org/cs/2010/03/no_big_quality_problems_at_toy.html">Toyota’s recall issues from 2009-10</a> wound up isolating misadjusted accelerator controls, speculation around the recalls directed the spotlight to the <a href="http://blogs.computerworld.com/15547/toyotas_lesson_software_can_be_unsafe_at_any_speed">prominent role of embedded software</a>, prompting the realization that today when you operate your car, you are <a href="http://www.cvel.clemson.edu/auto/systems/auto-systems.html">driving by wire</a>.</p>
<p>Today’s automobiles are increasingly looking a lot more like consumer electronics products. They contain nearly as much software an iPhone, and in the future will contain even more. According to IDC, the market for embedded software that is designed into engineered products (like cars, refrigerators, airplanes, and consumer electronics) <a href="http://www.cio.com/article/689563/IDC_Embedded_Systems_Market_to_Double_By_2015">will double by 2015</a>.</p>
<p>Automobiles are the tip of the iceberg where it comes to smart products; today most engineered products, from refrigerators to industrial machinery and aircraft all feature smart control. Adding intelligence allows designers to develop flexible control logic that brings more functionality to products and provides ways to optimize operation to gain savings in weight, bulk, and cost.</p>
<p>Look at the <a href="http://www.hybridcars.com/index.php">hybrid car</a>: to function, the battery, powertrain, gas and electric engines, and braking systems must all interoperate to attain fuel economy. It takes software to determine when to let the electric engine run or let the battery recharge. The degree of interaction between components is greater compared to traditional electromechanical products designs. Features such as anti-lock braking or airbag deployment depend on the processing of data from multiple sources – wheel rotation, deceleration rate, steering, etc. </p>
<p>The growth of software content changes the ground rules for product development, which has traditionally been a very silo’ed process. There are well established disciplines in mechanical and electrical engineering, with each having their own sets of tools, not to mention claims to ownership of the product design. Yet with software playing the role as the “brains” of product operation, there is the need for engineering disciplines to work more interactively across silos rather than rely on systems engineers to crack the whip on executing the blueprint.</p>
<p>We were reminded of this after a rather enjoyable, freewheeling <a href="http://www.computer.org/portal/web/webinars/Register-for-a-Webinar">IEEE webcast</a> <a href="https://event.on24.com/eventRegistration/EventLobbyServlet?target=registration.jsp&#038;eventid=373702&#038;sessionid=1&#038;key=A53B9FD15A27C492AE80579876A7F570&#038;sourcepage=register">that we had</a> with <a href="http://www.rational.com/">IBM Rational’s</a> <a href="http://www.youtube.com/watch?v=AqXEkxVWueU">Dominic Tavasolli</a> last week.</p>
<p>Traditionally, product design fell under the mechanical engineering domain, which designed the envelope and specified the geometry, components, materials, physical properties (such as resistance to different forms of stress) and determined the clearance within which electronics could be shoehorned.</p>
<p>Drill down deeper and you’ll note that each engineering domain has its full lifecycle of tools. It’s analogous to enterprise software development organizations, where you’ll often stumble across well entrenched camps of Microsoft, Java, and web programmers. Within the lifecycle there is a proliferation of tools and languages to deal with the wide variety of engineering problems that must be addressed when developing a product. Unlike the application lifecycle, where you have specific tools that handle modeling or QA, on the engineering side there are multiple tools because there are many different ways to simulate a product’s behavior in the real world to perform the engineering equivalent of QA. You might want to test mechanical designs for wind shear, thermal deformation, or compressive stresses, and electrical ones for their ability to handle voltage and disperse heat from processing units.</p>
<p>Now widen out the picture. Engineering and manufacturing groups each have their own definitions of the product. It is expressed in the <a href="http://en.wikipedia.org/wiki/Bill_of_materials">bill of materials (BOM)</a>: <a href="http://www.arenasolutions.com/manufacturing-bom.html">engineering has its own BOM</a>, which details the design hierarchy, while <a href="http://www.arenasolutions.com/manufacturing-bom.html">the manufacturing BOM</a> itemizes the inventory materials and the manufacturing processes needed to fabricate and assemble the product. That sets the stage for the question of who owns the <a href="http://en.wikipedia.org/wiki/Product_lifecycle_management">product lifecycle management (PLM)</a> process: the <a href="http://www.3ds.com/plm/">CADCAM</a> vs. the <a href="http://www.sap.com/solutions/business-suite/plm/index.epx">ERP</a> folks.</p>
<p>Into the mix between the different branches of engineering and the silos between engineering and manufacturing, now introduce the software engineers. They used to be an afterthought, yet today their programs are affecting, not only how product components and systems behave, but in many cases might impact the physical specifications. for instance, if you can design software to enable a motor to run more efficiently, the mechanical engineers can then design a smaller, lighter weight engine. </p>
<p>In the enterprise computing world, we’ve long gotten hung up on the silos that divide different parts of IT from itself – the developers vs. QA, DBAs, enterprise architects, systems operations – or IT from the business. However, the silos that plague enterprise IT are child’s play compared to the situation in product development where you have engineering groups pared off against each other, and against manufacturing.</p>
<p>OK, so the product lifecycle is a series of fiefdoms – why bother or care about making it more efficient? There is too much at stake in the success of a product: there are the constantly escalating pressures to squeeze time, defects, and cost out of the product lifecycle. That’s been the routine ever since the Japanese introduced American concepts of <a href="http://www.pqa.net/ProdServices/leanmfg/lean.html">lean manufacturing</a> back in the 1980s. But as automobiles and other complex engineered products adds more intelligence, the challenge is leveraging the rapid innovation of the software and consumer electronics industries for product sectors where, of necessity, lead times will stretch into one or more years.</p>
<p>There is no easy solution because there is no single solution. Each industry has different product characteristics that impact the length of the lifecycle and how product engineering teams are organized. Large, highly complex products such as automobiles, aircraft, or heavy machinery will have long lead times because of supply chain dependencies. At the other end of the scale, handheld consumer electronics or biomedical devices might not have heavy supply chain dependences. But, for instance, smart phones have short product lifespans and are heavily driven by the fats pace of innovation in processing power and software capabilities, meaning that product lifecycles must be quicker in order for new products to catch the market window. Biomedical devices on the other hand are often compact, but have significant regulatory hurdles to mount which impacts how the devices are tested.</p>
<p>The product lifecycle is a highly varied creature. The common thread is the need to more effectively integrate software engineering, which in turn is forcing the issue of integration and collaboration between other engineering disciplines. It is no longer sufficient to rely on systems engineers to get it together in the end – as manufacturers learned the hard way, it costs more to rework a design that doesn’t fit together, perform well, or be readily assembled with existing staff and facilities. The rapid evolution of software and processors also forces the issue on whether and where agile development processes can be coupled with linear or hierarchical development processes that are necessary for long-fuse products.</p>
<p>There is no single lifecycle process that will apply to all sectors, and no single set of tools that can perform every design and test function necessary to get from idea to product. Ultimately, the answer – as loose as it is – is that in larger product development organizations, work on the assumption that there are multiple sources of truth. The ALM and PLM worlds have at best worked warily at arms length from each other as there is a DMZ when it comes to requirements, change, and quality management. The reality is that no single constituency owns the product lifecycle – get used to federation that will proceed on rules of engagement that will remain industry- and organization-specific.</p>
<p>Ideally it would be great to integrate everything. Good luck. With the exception of frameworks that are proprietary for specific vendors, there is no associativity between tools that provides a process-level integration. The best that can be expected at this point is at the data exchange level.</p>
<p>It’s a start.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.onstrategies.com/blog/2011/11/20/who-owns-the-product-lifecycle/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>What will Hadoop be when it grows up?</title>
		<link>http://www.onstrategies.com/blog/2011/11/11/what-will-hadoop-be-when-it-grows-up/</link>
		<comments>http://www.onstrategies.com/blog/2011/11/11/what-will-hadoop-be-when-it-grows-up/#comments</comments>
		<pubDate>Fri, 11 Nov 2011 23:26:29 +0000</pubDate>
		<dc:creator>Tony Baer</dc:creator>
				<category><![CDATA[Big Data]]></category>
		<category><![CDATA[Business Intelligence]]></category>
		<category><![CDATA[Data Management]]></category>
		<category><![CDATA[Database]]></category>

		<guid isPermaLink="false">http://www.onstrategies.com/blog/?p=771</guid>
		<description><![CDATA[Hadoop World was sold out and it seemed like “For Hire” signs were all over the place –- or at least that&#8217;s what it said on the slides at the end of many of the presentations. &#8220;We&#8217;re hiring, and we&#8217;re paying 10% more than the other guys,” declared a member of the office of the [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.hadoopworld.com/">Hadoop World</a> was sold out and it seemed like “For Hire” signs were all over the place –- or at least that&#8217;s what it said on the slides at the end of many of the presentations. &#8220;We&#8217;re hiring, and we&#8217;re paying 10% more than the other guys,” declared a member of the office of the CIO at JP MorganChase in a <a href="http://www.informationweek.com/news/software/info_management/231902645">conference keynote</a>. Not to mention predictions that <a href="http://siliconangle.tv/video/cube-hadoop-world-2011-day-2-break-2-part-2">there’s big money in big data</a>. Or that <a href="http://accel.com/bigdata/">Accel Partner’s announced</a> a new $100 million venture fund for big data startups; <a href="http://cloudera.com/">Cloudera </a> scored <a href="http://www.cloudera.com/company/press-center/releases/Cloudera-Nets-40-Million-in-Series-D-Funding-Round-Led-by-Ignition-Partners">$40 million in D funding</a>; and rival <a href="http://hortonworks.com/">Hortonworks</a> <a href="http://www.xydo.com/toolbar/23261203-exclusive_yahoo_hortonworks_funding_details_20m_raise_200m">previously secured $20 million</a> for Round A.</p>
<p>These are heady days. For some like <a href="http://twitter.com/#!/mjasay">Matt Asay</a> it&#8217;s time to <a href="http://www.channelregister.co.uk/2011/11/11/hadoop_funding_wars/"> voice a word of caution</a> for all the venture money pouring into Hadoop: Is the field bloating with more venture dollars than it can swallow?</p>
<p>The resemblance to Java 1999 was more than coincidental; like Java during the dot com bubble, Hadoop is a relatively new web-related technology undergoing its first wave of commercialization ahead of the buildup of the necessary skills base. We haven’t seen such a greenfield opportunity in the IT space in over a decade. And so the mood at the conference became a bit heady –– where else in the IT world today is the job scene a seller’s market?</p>
<p><a href="http://hadoop.apache.org/">Hadoop</a> has come a long way in the past year. A poll of conference attendees showed at least 200 petabytes under management. And while Cloudera has had a decent logo slide of partners for a while, it is no longer the lonely voice in the wilderness for delivering commercial distributions and enterprise support of Hadoop. Within this calendar year alone, Cloudera has finally drawn the competition to legitimize Hadoop as a commercial market. You’ve got the household names from data management and storage -– <a href="http://www-01.ibm.com/software/data/infosphere/biginsights/">IBM</a>, <a href="http://www.oracle.com/us/technologies/big-data/index.html?origref=http://www.google.com/url?sa=t&#038;rct=j&#038;q=oracle%20big%20data&#038;source=web&#038;cd=1&#038;sqi=2&#038;ved=0CDIQFjAA&#038;url=http%3A%2F%2Fwww.oracle.com%2Fus%2Fbigdata%2Findex.html&#038;ei=LFm9To6ADsTL0QGthqy8BA&#038;usg=AFQjC">Oracle</a>, <a href="http://www.greenplum.com/news/press-releases/EMC-delivers-industrys-first-unified-big-data-analytics-appliance">EMC</a>, <a href="http://arstechnica.com/business/news/2011/10/microsoft-makes-its-move-with-hadoop-on-azure-and-windows-server.ars">Microsoft</a>, and <a href="http://www.asterdata.com/resources/mapreduce.php#SQLMR">Teradata</a> &#8212; jumping in.</p>
<p>Savor the moment. Because the laws of supply and demand are going to rectify the skills shortage in Hadoop and MapReduce and the market is going to become more “normal.” Colleagues like Forrester’s <a href="http://twitter.com/#!/jameskobielus">Jim Kobielus</a> <a href="http://blogs.forrester.com/james_kobielus/11-06-08-hadoop_future_of_enterprise_data_warehousing_are_you_kidding">predict</a> Hadoop is going to enter the enterprise data warehousing mainstream; he’s also gone on record that interactive and near real-time Hadoop analytics are not far off.</p>
<p>Nonetheless, Hadoop is not going to be the end-all; with the learning curve, we’ll understand the use cases where Hadoop fits and where it doesn’t.</p>
<p>But before we declare victory and go home, we’ve got to get a better handle of what Hadoop is and what it can and should do. In some respects, Hadoop is undergoing a natural evolution that happens with any successful open source technology: there are always questions over what is the kernel and where vendors can differentiate.</p>
<p>Let’s start with the <a href="http://www.slideshare.net/davidengfer/intro-to-the-hadoop-stack-javamug">Apache Hadoop stack</a>, which <a href="http://www.cloudera.com/resource/hadoop-world-2011-presentation-slides-the-hadoop-stack-then-now-and-in-the-future">is increasingly resembling a huge brick wall</a> where things are arbitrarily stacked atop one another with no apparent order, sequence, or interrelationship. Hadoop is not a single technology or open source project but –– depending on your perspective –– an ecosystem or a tangled jumble of projects. We won&#8217;t bore you with the full list here, but Apache projects are proliferating. That&#8217;s great if you’re an open source contributor as it provides lots of outlet for innovation, but if you’re at the consuming end in enterprise IT, the last thing you want is to have to maintain a live scorecard on what&#8217;s hot and what’s not.</p>
<p>Compounding the situation, there is still plenty of experimentation going on. Like most open source technologies that get commercialized, there is the question of where the open source kernel leaves off and vendor differentiation picks up. For instance, <a href="http://mapr.com/products/apache-hadoop">MapR</a> and IBM each believe it is in the file system, with both having have their own answers to the inadequacies of the core Hadoop file system, (<a href="http://hadoop.apache.org/common/docs/current/hdfs_user_guide.html">HDFS</a>).</p>
<p>But enterprises need an answer. They need to know what makes Hadoop, Hadoop. Knowing that is critical, not only for comparing vendor implementations, but software compatibility. Over the coming year, we expect others to follow <a href="http://www.karmasphere.com/">Karmasphere</a> and create development tooling, and we also except new and existing analytic applications to craft solutions targeted at Hadoop. If that&#8217;s the case, we better know where to insist on compatibility. Defining Hadoop the way that Supreme Court justice Potter Stewart <a href="http://en.wikipedia.org/wiki/I_know_it_when_I_see_it">defined pornography</a> (“I know it when I see it”) just won&#8217;t cut it.</p>
<p>Of course, Apache is the last place to expect clarity as that&#8217;s not its mission. The Apache Foundation is a meritocracy. Its job is not to pick winners, although it will step aside once the market pulls the plug as it did when it <a href="http://adtmag.com/articles/2011/11/07/apache-harmony-project-gets-mothballed.aspx">mothballed Project Harmony</a>. That’s where the vendors come in –– they package the distributions and define what they support. What&#8217;s needed is not an intimidating huge rectangle showing a profile, but instead a concentric circle diagram. For instance, you’d think that the file system would be sacred to Hadoop, but if not, what are the core building blocks or kernel of Hadoop? Put that at the center of the circle and color it a dark red, blue, or the most convincing shade of <a href="http://www.google.com/imgres?imgurl=http://www.dataversity.net/wp-content/uploads/2011/06/ed3d1_hadoop-logo.jpg&#038;imgrefurl=http://www.dataversity.net/archives/3817&#038;h=225&#038;w=300&#038;sz=17&#038;tbnid=-a6fkoBedJyT_M:&#038;tbnh=87&#038;tbnw=116&#038;prev=/search%3Fq%3Dhadoop%2Blogo%26t">elephant yellow</a>. Everything else surrounds the core and is colored pale. We call upon the Clouderas, Hortonworks, IBMs, EMCs et al to step up the plate and define Hadoop.</p>
<p>Then there’s the question of what Hadoop does. We know what it’s done traditionally. It’s a large distributed file system that is used for offline, a.k.a., batch –– analytic runs grinding through ridiculous amounts of data. Hadoop literally chops huge problems down to size thanks a lot of things: it has a simple file structure and it brings computation directly to the data; leverages cheap commodity hardware; supports scaled-out clustering; has a highly distributed and replicated architecture; and uses the MapReduce pattern for dividing and pipelining jobs into lots of concurrent threads, and mapping them back to unity.</p>
<p>But we also caught a <a href="http://www.cloudera.com/resource/hadoop-world-2011-presentation-slides-building-realtime-big-data-services-at-facebook-with-hadoop-and-hbase">presentation</a> from Facebook’s Jonathan Grey on how Hadoop and its <a href="http://hbase.apache.org/">HBase</a> column store was adapted to real-time operation for several core applications at Facebook such as its unified messaging system, the polar opposite of a batch application. In summary, there were a number of brute force workarounds to make Hadoop and HBase more performant, such as extreme denormalization of data; heavy reliance on smart caching; and use of <a href="http://en.wikipedia.org/wiki/Inverted_index">inverted indexes</a> that point to the physical location of data, and so on. There’s little doubt that Hadoop won&#8217;t become a mainstream enterprise analytic platform until performance bottlenecks are addressed. Not surprisingly, there’s little doubt that the HBase Apache project is targeting interactivity as one of the top development goals.</p>
<p>Conversely, we also heard lots of mention about the potential for Hadoop to function as an online alternative to offline archiving. That’s fed by an architectural design assumption that Big Data analytic data stores allow organizations to analyze all the data, not just a sample of it. Organizations like Yahoo have demonstrated dramatic increases in click-through rates from using Hadoop to dissect all user interactions. That&#8217;s instead of using MySQL or other relational data warehouse that can only analyze a sampling. And the Yahoos and Googles of the world currently have no plan to archive their data –– they will just keep scaling their Hadoop clusters out and distributing them. Facebook’s messaging system –– which was used for rolling out real-time Hadoop, is also designed with the use case that old data will not be archived.</p>
<p>The challenge is that the same Hadoop cannot be all things to all people. Optimizing the same data store for interactive and online archiving is like violating the laws of gravity –– either you make the storage cheap or you make it fast. Maybe there will be different flavors of Hadoop, as data in most organizations outside the Googles, Yahoos, or Facebooks of the world is more mortal –– as are the data center budgets.</p>
<p>Admittedly, there is an emerging trend to brute force design databases for mixed workloads –– that&#8217;s the design pattern behind Oracle’s <a href="http://www.oracle.com/us/products/database/exadata-database-machine/overview/index.html?origref=http://www.google.com/url?sa=t&#038;rct=j&#038;q=exadata&#038;source=web&#038;cd=1&#038;sqi=2&#038;ved=0CEQQFjAA&#038;url=http%3A%2F%2Fwww.oracle.com%2Fus%2Fexadata%2Findex.html&#038;ei=Wam9Tr2jH-La0Q">Exadata</a>. But even Oracle’s Exadata strategy has limitations in that its design will be overkill for smaller-midsize organizations, and that is exactly why Oracle came out with the <a href="http://www.oracle.com/us/products/database/database-appliance/index.html">Oracle Database Appliance</a>. Same engine, but optimized differently. As few organizations will have Google’s IT budget, Hadoop will also have to have personas –– one size won’t fit all. And the Hadoop community –– Apache and vendor alike –– has got to decide what Hadoop’s going to be when it grows up.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.onstrategies.com/blog/2011/11/11/what-will-hadoop-be-when-it-grows-up/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>The Elegance of Steve Jobs</title>
		<link>http://www.onstrategies.com/blog/2011/10/05/the-elegance-of-steve-jobs/</link>
		<comments>http://www.onstrategies.com/blog/2011/10/05/the-elegance-of-steve-jobs/#comments</comments>
		<pubDate>Thu, 06 Oct 2011 01:17:16 +0000</pubDate>
		<dc:creator>Tony Baer</dc:creator>
				<category><![CDATA[OS/Platforms]]></category>
		<category><![CDATA[Technology Market Trends]]></category>

		<guid isPermaLink="false">http://www.onstrategies.com/blog/?p=765</guid>
		<description><![CDATA[Outside of politicians there are few individuals that have truly changed the way we live. It’s more than coincidental that Steve Jobs named his company after the record company of The Beatles, the group of four individuals who changed the musical tastes of our generation.
Steve jobs’ life was obviously too short, but in that short [...]]]></description>
			<content:encoded><![CDATA[<p>Outside of politicians there are few individuals that have truly changed the way we live. It’s more than coincidental that Steve Jobs named his company after the record company of The Beatles, the group of four individuals who changed the musical tastes of our generation.</p>
<p>Steve jobs’ life was obviously too short, but in that short life he crammed four public lives. He was one of the first in Silicon Valley who saw a personal future for the technology being invented there; that culminated with the Apple II. His next life introduced the GUI; after a false start with Lisa, the Mac was a fully realized system that made Apple the de facto publishing machine. It also transformed Apple into a corporation, a challenge for which Jobs was not yet prepared. His third life was NeXT, which provided the springboard for his final life #4, returning to Apple.</p>
<p>It would be an accomplishment on its own to say that Jobs returned Apple to its former glory. That’s an understatement. Under hjis (final) watch, Apple evolved from computer company, changing the way we consume music and media; significantly it was Jobs that finally got the record companies to agree on  a common pricing model. Then he redefined the mobile experience with the iPhone, and introduced a new form of computing with the iPad.</p>
<p>In so doing, Apple has changed ouir lives and changed industries. Although music downloads were going to happen regardless of the iPod, it not only made CDs obsolete, but also record stores, and arguably, albums. It also made it more accessible for garage bands everywhere to distribute and bypass the record company, a situation from which the record companies shave yet to recover. It&#8217;s also changing the nature of the phone business, and realigning major handset providers.</p>
<p>But most of all we’ll miss Steve Jobs’ sense of style. The minimalism that was Apple provided a sense of elegance and peace that cuts through the noise of our everyday lives. For that alone, thank you Steve Jobs.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.onstrategies.com/blog/2011/10/05/the-elegance-of-steve-jobs/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Tibco: The Best Surprise is No Surprise</title>
		<link>http://www.onstrategies.com/blog/2011/10/02/tibco-the-best-surprise-is-no-surprise/</link>
		<comments>http://www.onstrategies.com/blog/2011/10/02/tibco-the-best-surprise-is-no-surprise/#comments</comments>
		<pubDate>Mon, 03 Oct 2011 00:20:23 +0000</pubDate>
		<dc:creator>Tony Baer</dc:creator>
				<category><![CDATA[BPM]]></category>
		<category><![CDATA[Business Intelligence]]></category>
		<category><![CDATA[Data Management]]></category>
		<category><![CDATA[Fast Data]]></category>
		<category><![CDATA[Middleware]]></category>

		<guid isPermaLink="false">http://www.onstrategies.com/blog/?p=761</guid>
		<description><![CDATA[Tibco has been running on all cylinders of late. In earnings and revenues, it has kept up with the Joneses in the enterprise software neighborhood, running respectable 25% revenue and 30+% software license growth numbers in its most recent quarterly year over year results as we’ve noted in several of our recent Ovum research notes.
It [...]]]></description>
			<content:encoded><![CDATA[<p>Tibco has been running on all cylinders of late. In earnings and revenues, it has kept up with the Joneses in the enterprise software neighborhood, running respectable 25% revenue and 30+% software license growth numbers <a href="http://www.tibco.com/multimedia/press1122_tcm8-14381.pdf">in its most recent quarterly year over year results</a> as we’ve noted in <a href="http://about.ovum.com/app/where-should-the-two-second-advantage-take-tibco-next/">several of our recent Ovum research</a> <a href="http://about.ovum.com/app/nimbus-broadens-tibcos-bpm-stable-to-appeal-to-business-users/">notes</a>.</p>
<p>It is beginning to make the turn from its geeky roots towards more solution selling to the business side in tone and deed. Ever since <a href="http://www.tibco.com/company/news/releases/2007/press810.jsp">the 2007 Spotfire acquisition</a> – which brought real-time analytic visualizations – it has made several buys that are more targeted to the business rather than strictly the IT or CIO side. They include <a href="http://www.netrics.com/">Netrics</a>, for fuzzy logic technologies for pattern matching; <a href="http://www.loyaltylab.com/m/default.aspx">Loyalty Lab</a>, for managing customer affinity programs; and Nimbus, a recent addition, which adds process discovery and management of manual activity that comprise the other 80% of what happens inside an enterprise.</p>
<p>Of course it’s not as if Tibco were trying to pull an HP in doing a 180 on its business strategy (heaven forbid, we don&#8217;t need any more <a href="http://www.cio.com/article/690304/Lane_Defends_HP_Decision_to_Name_Whitman_CEO">senseless Silicon Valley soap operas</a>!). Core infrastructure plays, such as FTL ultra low latency messaging or the DataSynapse data grid, remain core to Tibco’s 2-second advantage mission. It’s just that, in modest but growing cases, the raw technology is being packaged as a black box underneath more business-focused solutions. For instance, Tibco is packaging solutions for retail such as Active Catalog and Active Fulfillment that underneath the hood bundle <a href="http://www.tibco.com/products/business-optimization/complex-event-processing/businessevents/default.jsp">Tibco Business Events</a> (<a href="http://en.wikipedia.org/wiki/Complex_event_processing">CEP</a>), <a href="http://www.tibco.com/products/bpm/bpm-enterprise/activematrix-bpm/default.jsp">Active Matrix BPM</a>, and other pieces.</p>
<p>Of course, such transformations don&#8217;t come overnight, as there is the need to get field sales up to speed and accustomed to calling on new entry points at target prospect. Not surprisingly, Tibco is also ramping up vertical solutions, but on an opportunistic basis. An example: we met with a European telco customer that is using Business Events for monitoring devices (in this case, water meters) which may present an opportunity for Tibco to develop an <a href="http://en.wikipedia.org/wiki/Machine-to-Machine">M2M</a> (machine-to-machine) event-driven integration solution that could be more widely applied to segments such as utilities or logistics/transportation.</p>
<p>Several of <a href="http://www.enterpriseirregulars.com/17654/tibco%E2%80%99s-recent-acquisitions-datasynapse-foresight-netrics-and-spotfire/">its recent acquisitions</a>, such as Foresight, a healthcare payer EDI gateway; Open Sprit, for data integration for upstream oil and gas processing, are strictly vertical plays. Loyalty Labs, which provides analytics for customer affinity programs, has helped make retail one of its fastest growing verticals coming from a near-zero client base a few years back.</p>
<p>Tibco is traveling a similar road as IBM, but is starting from much earlier point in developing vertical solutions. As Tibco lacks the professional services presence of IBM, it has to cherry pick its vertical opportunities.</p>
<p>At this point, the major disrupters for Tibco are big data and mobility. </p>
<p>For mobile the challenge is integrating alerts from Tibco’s Business Events and Spotfire engines to clients; <a href="http://www.tibbr.com/">tibbr</a>, its internal collaboration messaging platform, provides the logical environment for bringing its events feed out to mobile devices. This could be bolstered with its recent Nimbus acquisition, both for input (process discovery, using mobile devices to snap a picture, for instance) and output (for communicating how to perform manual processes out to the field).</p>
<p>Big data positioning and productization for Tibco is also a work in progress. Its message busses can in some cases handle enormous amounts of data; its business event engine could also provide feeds if Tibco can make the sensing agent more lightweight; its BPM offering could be configured to get triggered based on specific event patterns that may involve crunching of enormous volumes of event feeds.</p>
<p>But there is a brave new world of variably structured data that is becoming fair game for enterprises to sense and respond. We don&#8217;t expect Tibco to buy its own Advanced SDQL platform or create its own Hadoop distribution, as Tibco is not about data at rest, nor is it a database player (OK, its <a href="http://en.wikipedia.org/wiki/Master_data_management">MDM</a> offering does have to store master and reference data). Nonetheless, delivering the 2-second advantage in a big world where the data is getting bigger, bigger, and more heterogeneous raises the urgency for Tibco to distinguish itself in extending its visibility.</p>
<p>When we were asked by the executive marketing team of our impressions this year, our thoughts were, well, there was hardly anything newsworthy. That’s not necessarily a bad thing, as during a strategy roadmap presentation at this year’s Tibco TUCON conference, a timeline of Tibco acquisitions showed roughly a half dozen entries for 2010 and just one for this year. Over the past year Tibco has been preoccupied with absorbing the new acquisitions and so – Nimbus excluded – has not been active on this front lately. For instance, Tibco has integrated the Netrics fuzzy pattern matching engine into Business Events, where it belongs.. It has similarly blended the recently acquired data grid technology with Business Events. <a href="http://www.column2.com/2011/09/tibco-product-strategy-with-matt-quinn/">Check out Sandy Kemsley&#8217;s</a> post for a more detailed blow-by-blow on how Tibco has rounded out its product portfolio over the past year.</p>
<p>With the swoon on Wall Street, Tibco has left its $250 cash stash alone, in spite of the fact that there are plenty of acquisition targets available at reasonable prices right now as a lot of venture funds are looking for exits. By its CFO’s words, the company is not as enormous as IBM or Oracle, where acquisitions don&#8217;t disrupt the entire company. Nonetheless, we expect that 2012 will grow more active in acquisitions – we hope that acquisition of a data quality provider makes the top of the shopping list.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.onstrategies.com/blog/2011/10/02/tibco-the-best-surprise-is-no-surprise/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>HP does a 180 – Now it’s Apotheker’s Company</title>
		<link>http://www.onstrategies.com/blog/2011/08/19/hp-does-a-180-%e2%80%93-now-it%e2%80%99s-apotheker%e2%80%99s-company/</link>
		<comments>http://www.onstrategies.com/blog/2011/08/19/hp-does-a-180-%e2%80%93-now-it%e2%80%99s-apotheker%e2%80%99s-company/#comments</comments>
		<pubDate>Fri, 19 Aug 2011 04:46:00 +0000</pubDate>
		<dc:creator>Tony Baer</dc:creator>
				<category><![CDATA[Business Intelligence]]></category>
		<category><![CDATA[Cloud]]></category>
		<category><![CDATA[Data Management]]></category>
		<category><![CDATA[Database]]></category>
		<category><![CDATA[OS/Platforms]]></category>
		<category><![CDATA[Supply Chain Management]]></category>
		<category><![CDATA[Technology Market Trends]]></category>

		<guid isPermaLink="false">http://www.onstrategies.com/blog/?p=745</guid>
		<description><![CDATA[HP chose the occasion of its Q3 earnings call to drop the bomb. The company that under Mark Hurd’s watch focused on Converged Infrastructure, spending almost $7 billion to buy Palm, 3COM, and 3PAR, is now pulling a 180 in ditching both the PC and Palm hardware business, and making an offer to buy Autonomy, [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.hp.com/">HP</a> chose the occasion of its Q3 earnings call to drop the bomb. The company that under <a href="http://www8.hp.com/us/en/company-information/executive-team/mark.html">Mark Hurd’s</a> watch focused on <a href="http://www8.hp.com/us/en/solutions/solutions-detail.html?compURI=tcm:245-785656&#038;pageTitle=converged-infrastructure&#038;contentView=business?jumpid=ex_r163_us/en/large/eb/ESN_convrginfrabr_googlesemaw&#038;k_clickid=AMS|03970dbf-436f-f089-d3a8-000073a7458f">Converged Infrastructure</a>, spending almost $7 billion to buy <a href="http://www.hp.com/hpinfo/newsroom/press/2010/100428xa.html">Palm</a>, <a href="http://www.hp.com/hpinfo/newsroom/press/2010/100412xa.html">3COM</a>, and <a href="http://www.hp.com/hpinfo/newsroom/press/2010/100902b.html">3PAR</a>, is now pulling a 180 in ditching both the PC and Palm hardware business, and <a href="http://www.hp.com/hpinfo/newsroom/press/2011/110818xc.html?mtxs=rss-corp-news">making an offer to buy</a> <a href="http://www.autonomy.com/">Autonomy</a>, one of the last major independent enterprise content management players, for roughly $11 billion.</p>
<p>At first glance, the deal makes perfect sense, given <a href="http://www8.hp.com/us/en/company-information/executive-team/apotheker.html">Leo Apotheker’s</a> enterprise software orientation. From that standpoint, Apotheker has made some shrewd moves, putting aging enterprise data warehouse brand <a href="http://www.onstrategies.com/blog/2011/01/27/hp-puts-neoview-out-of-its-misery/">Neoview</a> out of its misery, following up weeks later with the <a href="http://www.onstrategies.com/blog/2011/02/14/hps-acquisition-of-vertica-out-with-the-old-in-with-the-new/">acquisition of Advanced SQL analytics platform provider</a> <a href="http://www.vertica.com/">Vertica</a>. During the <a href="http://h30261.www3.hp.com/phoenix.zhtml?c=71087&#038;p=irol-newsArticle&#038;ID=1598003&#038;highlight=">Q3 earnings</a> call, Apotheker stated the obvious as to his comfort factor with Autonomy: “I have spent my entire professional life in software and it is a world that I know well. Autonomy is very complementary.”</p>
<p>There is potential synergy between Autonomy and Vertica, with Autonomy CEO <a href="http://www.telegraph.co.uk/finance/8710037/Profile-Mike-Lynch-Autonomy-founder.html">Mike Lynch</a> (who will stay on as head of the unit, reporting to Apotheker) that Autonomy’s user screens provide the long missing front end to Vertica, and that both would be bound by a common “information layer.” Of course, the acquisition not being final, he did not give details on what that layer is, but for now we’d assume that integration will be at presentation and reporting layer. There is clearly a lot more potential here &#8212; Vertica for now only holds structured data while Autonomy&#8217;s IDOL system holds everything else. In the long run we&#8217;d love to see federated metadata and also an extension of Vertica to handle unstructured data, just as Advanced SQL rivals like <a href="http://www.teradata.com">Teradata&#8217;s</a> <a href="http://asterdata.com">Aster Data</a> already do.</p>
<p>Autonomy, according to my Ovum colleague <a href="http://www.ovum.com/go/content/c,432,66490">Mike Davis</a> who has tracked the company for years, is one of only three ECM providers that have mastered the universal document viewer – <a href="http://www.oracle.com/us/corporate/Acquisitions/stellent/index.html">Oracle’s Stellent</a> and an Australian open source player being the others. In contrast to HP (more about that in a moment), <a href="http://www.autonomy.com/content/Investors/Releases/2011/0727f.en.html">Autonomy is quite healthy</a> with the latest quarterly revenues up 16% year over year, operating margins in the mid 40% range, and a run rate that will take the company to its first billion dollar year.</p>
<p>Autonomy is clearly a gem, but HP paid dearly for it. During Q&#038;A on the earnings call, a Wall street analyst took matters back down to earth, asking whether HP got such a good deal, given that it was paying roughly 15% of its market cap for a company that will only add about 1% to its revenues.</p>
<p>Great, expensive acquisition aside, HP’s not doing so well these days. Excluding a few bright spots, such as its <a href="https://www.fortify.com/">Fortify</a> security software business, most of HP’s units are running behind last year. Q3 net revenue of $31.2 billion was up only 1% over last year, but down 2% when adjusted for constant currency. By contrast, <a href="http://www.businesswire.com/news/home/20110718006675/en/IBM-Reports-2011-Second-Quarter-Results">IBM’s most recent results</a> were up 12% and 5%, respectively, when currency adjusted. <a href="http://twitter.com/#!/dahowlett/status/104304255346814977">Dennis Howlett tweeted</a> that it was now HP’s turn to undergo IBM&#8217;s near-death experience.</p>
<p>More specifically, HP Software was the bright spot with 20% growth year over year and 19.4% operating margin. By contrast, the printer and ink business – long HP’s cash cow – dropped 1% year over year with the economy dampening demand from the commercial side, not to mention supply chain disruptions from the Japanese tsunami. </p>
<p>By contrast, services grew only 4%, and is about to kick in yet another round of transformation. <a href="http://www.crn.com/news/channel-programs/231500315/hp-appoints-new-head-of-enterprise-services.htm">John Visenten</a>, who ran HP’s Enterprise services in the Americas region, comes in to succeed <a href="http://www8.hp.com/us/en/company-information/executive-team/livermore.html">Ann Livermore</a>. The problem is, as Ovum colleague <a href="http://www.itoamerica.com/index.php?section=bio&#038;id=662&#038;name=John+Madden">John Madden</a> states it, HP’s services “has been in a constant state of transformation” that is making some customers’ patience wear thin. Ever since acquiring EDS, HP has been trying – and trying – to raise the legacy outsourcing business higher up the value chain, with its sights literally set in the cloud.</p>
<p>The trick is that as HP tries aiming higher up the software and services food chain, it deals with a market that has longer sales cycles and long-term customer relationships that prize stability. Admittedly, when Apotheker was named CEO last fall, along with enterprise software veteran ray Lane to the board, the conventional wisdom was that HP would train its focus on enterprise software. So to that extent, HP’s strategy over the past 9 months has been almost consistent – save for earlier pronouncements on the strategic role of the tablet and <a href="http://www.hpwebos.com/us/products/software/webos2/">WebOS</a> business inherited with Palm.</p>
<p>But HP has been around for much longer than 9 months, and its latest shifts in strategy must be viewed with a longer perspective. Traditionally an engineering company, HP grew into a motley assortment of businesses. Before spinning off its geeky Agilent unit in 1999, HP consisted of test instruments, midrange servers and PCs, a token software business, and lest we forget, that printer business. Since then:<br />
•	The <a href="http://www.pcworld.com/article/60583/hp_to_buy_compaq_in_25_billion_deal.html">2001 acquisition of Compaq</a> that cost a cool $25 billion, under <a href="http://tech.fortune.cnn.com/2011/08/18/h-p-spinoff-carlys-final-goodbye/?section=magazines_fortune">Carly Fiorina&#8217;s</a> watch. That pitted it against Dell and caused HP to assume an even more schizoid personality as consumer and enterprise brand.<br />
•	Under Mark Hurd’s reign, software might have grown a bit (<a href="http://www.hp.com/hpinfo/newsroom/press/2006/060725a.html">they did purchase Mercury</a> after unwittingly not killing off their <a href="http://searchdatacenter.techtarget.com/definition/HP-OpenView">OpenView</a> business), but the focus was directed at infrastructure – storage, switches, and mobile devices as part of the Converged Infrastructure initiative.<br />
•	In the interim, HP swallowed EDS, succeeding at what it failed to do with <a href="http://www.crn.com/news/channel-programs/18818405/hp-channel-execs-speak-out-on-failed-pwc-deal.htm">its earlier ill-fated pitch for PwC</a>.</p>
<p>Then (1) <a href="http://www.oracle.com/us/corporate/press/170532http:/www.time.com/time/business/article/0,8599,2009617,00.html">Hurd gets tossed out</a> and (2) <a href="http://www.oracle.com/us/corporate/press/170532http:/www.time.com/time/business/article/0,8599,2009617,00.html">almost immediately lands at Oracle</a>; (3) <a href="http://www.oracle.com/us/corporate/press/346696">Oracle pulls support for HP Itanium servers</a>, (4) <a href="http://www.zdnet.com/blog/btl/hp-sues-oracle-over-itanium-support-does-the-lawsuit-make-sense/50822">HP sues Oracle</a>, and (5) its Itanium business sinks through the floor.</p>
<p>That sets the scene for today&#8217;s announcements that HP is “evaluating a range of options” (code speak for likely divestment) for its PC and tablet business – although it will keep WebOS on life support as its last gasp in the mobile arena. A real long shot: HP’s only hope for WebOS might be Android OEMs not exactly tickled pink about Google’s going into the handset business by buying Motorola’s mobile unit.</p>
<p>There is logical rationale for dropping those businesses – PCs have always been a low margin business in both sales and service, in spite of what it claimed to be <a href="http://www.zdnet.com/blog/btl/hps-pc-supply-chain-by-the-numbers/35698">an extremely efficient supply chain</a>. Although a third of its business, PCs were only 13% of HP’s profits, and have been declining in revenue for several years. PCs were big enough to provide a distraction and low enough margin to become a drain. And with Palm, HP gained an eloquent OS, but with a damaged brand that was too late to become the iOS alternative – Google had a 5-year headstart. Another one bites the dust.</p>
<p>Logical moves, but it’s fair to ask, what is an HP? Given HP’s twists, turns, and about-faces, a difficult one to answer. OK, HP is shedding its consumer businesses – except printers and ink because in normal times they are too lucrative – but HP still has all this infrastructure business. It hopes to rationalize all this in becoming a provider of cloud infrastructure and related services, with a focus on information management solutions.</p>
<p>As mentioned above, enterprises crave stability, yet HP’s track record over the past decade has been anything but. To be an enterprise provider, technology providers must demonstrate that they have a consistent strategy and staying power because enterprise clients don&#8217;t want to be left with orphaned technologies. To its credit, today’s announcements show the fruition of Apotheker’s enterprise software-focused strategy. But HP’s enterprise software customers and prospects need the assurance that HP won&#8217;t pull another about face when it comes time for Apotheker’s successor.</p>
<p><em>Postscript: Of course we all know how this one ended up. One good 180 deserved another. Exit Apotheker stage left. Enter Meg Whitman stage right. Reality has been reversed.</em></p>
]]></content:encoded>
			<wfw:commentRss>http://www.onstrategies.com/blog/2011/08/19/hp-does-a-180-%e2%80%93-now-it%e2%80%99s-apotheker%e2%80%99s-company/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Google and Motorola: Quick Post Mortem</title>
		<link>http://www.onstrategies.com/blog/2011/08/17/google-and-motorola-quick-post-mortem/</link>
		<comments>http://www.onstrategies.com/blog/2011/08/17/google-and-motorola-quick-post-mortem/#comments</comments>
		<pubDate>Wed, 17 Aug 2011 15:25:21 +0000</pubDate>
		<dc:creator>Tony Baer</dc:creator>
				<category><![CDATA[Application Development]]></category>
		<category><![CDATA[Java]]></category>
		<category><![CDATA[Mobile]]></category>
		<category><![CDATA[OS/Platforms]]></category>

		<guid isPermaLink="false">http://www.onstrategies.com/blog/?p=743</guid>
		<description><![CDATA[There’s been plenty of excellent commentary on Google’s $12.5 billion deal for Motorola Mobility Inc. (MMI) over the past few days, and we’re certainly not going to rehash covered ground.
Clearly this is a lot of money that was invested defensively. Money that could have gone into research or acquisitions that would have grown the business [...]]]></description>
			<content:encoded><![CDATA[<p>There’s been plenty of excellent commentary on <a href="http://www.google.com/intl/en/about/corporate/index.html">Google’s</a> $12.5 billion deal for <a href="http://www.motorola.com/Consumers/US-EN/Home">Motorola Mobility Inc. (MMI)</a> over the past few days, and we’re certainly not going to rehash covered ground.</p>
<p>Clearly this is a lot of money that was invested defensively. Money that could have gone into research or acquisitions that would have grown the business or opened new markets.</p>
<p>Whatever.</p>
<p>That thought hit us this morning after reading a <a href="http://www.nytimes.com/2011/08/17/technology/a-bull-market-in-tech-patents.html?hpw">NY Times piece on the bull market for patents.</a> It reinforced our thoughts after word of the deal broke: that this was money spent for arming Google against patent predators in courts of law. In this case, it’s predators sensing blood to slow down or at least exact royalties from the Android platform juggernaut.</p>
<p>Of course much of the issue stems from the subjective nature of software patents; that’s a longstanding issue given that the iterative nature of software development. It is simply difficult if not impossible to prove that a software innovation does not base itself in some way on prior invention. Furthermore, the fact that software relies on other software to operate makes the notion of software patents even more dubious.</p>
<p>This doesn&#8217;t mean that software developers should get away plagiarism. Although discovery is still underway, t<a href="http://fosspatents.blogspot.com/2011/08/oracle-v-google-update-summary-judgment.html">he evidence continues to get more damning in the Oracle-Google case over Dalvik</a>, the Android VM that on closer inspection looks like the JVM in sheep’s clothing. The irony is that when Google was still pulling its (J)VM clean room act, the company at the other end of the line was Sun. To us, this is a reflection of Google’s Not-Invented-Here mentality. Would it have killed them to secure a JVM license at the time, as they could have gotten far more reasonable terms from Sun – rather than Oracle, the new sheriff in town.</p>
<p>Just askin’.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.onstrategies.com/blog/2011/08/17/google-and-motorola-quick-post-mortem/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>From Big to Bigger Data: First Thoughts from Teradata Influencer Summit</title>
		<link>http://www.onstrategies.com/blog/2011/08/04/from-big-to-bigger-data-first-thoughts-from-teradata-influencer-summit/</link>
		<comments>http://www.onstrategies.com/blog/2011/08/04/from-big-to-bigger-data-first-thoughts-from-teradata-influencer-summit/#comments</comments>
		<pubDate>Thu, 04 Aug 2011 07:16:02 +0000</pubDate>
		<dc:creator>Tony Baer</dc:creator>
				<category><![CDATA[Big Data]]></category>
		<category><![CDATA[Business Intelligence]]></category>
		<category><![CDATA[Data Management]]></category>
		<category><![CDATA[Database]]></category>

		<guid isPermaLink="false">http://www.onstrategies.com/blog/?p=736</guid>
		<description><![CDATA[It’s kind of ironic that Teradata, which actually invented the big data, data warehouse is being grilled about its big data strategy. Hold that thought.
The crux of the first day of Teradata’s Third Party Influencers conference, a kind of Vegas summer camp for selected partners and analysts, was about how Teradata is expanding its footprint [...]]]></description>
			<content:encoded><![CDATA[<p>It’s kind of ironic that <a href="http://teradata.com">Teradata</a>, which actually invented the big data, data warehouse is being grilled about its big data strategy. Hold that thought.</p>
<p>The crux of the first day of Teradata’s Third Party Influencers conference, a kind of Vegas summer camp for selected partners and analysts, was about how Teradata is expanding its footprint as it competes as an independent in an avenue of giants.</p>
<p>As part of the tour, we were given a nostalgic glimpse at a 1998 &#8211; 99 vintage slide showing Teradata’s definition of an Enterprise Data Warehouse; it’s the definition of the classic galactic enterprise storehouse that never really became the single repository of all things analytic over the years. But for organizations like Wal-Mart or eBay, it provide the core research for the big analytic problems that such businesses require.</p>
<p>Teradata has recalibrated this vision to an “Integrated Data Warehouse” which is a more realistic notion in a world that has become so interconnected to the point where it’s ridiculous to think that you can centralize wisdom in a single place. Instead, the idea is to think beyond single purpose data warehouses, not necessarily to consolidate every departmental data mart in sight, but to put together places where you might have several intersecting fonts of wisdom. For instance, in a consumer products company, you might want to stage a warehouse that covers customer and product data, because there are going to be synergies when you start doing analytics to segment your customer base, because product preferences may provide some richness to the demographics.</p>
<p>In the past year, Teradata has done a couple of acquisitions that could reshape its course going forward. Acquisition of <a href="http://aprimo.com">Aprimo</a>, an integrated marketing campaign management provider that competes with <a href="http://ibm.com">IBM</a>&#8217;s recently-acquired <a href="http://unica.com">Unica</a> places Teradata into the applications space, although – like IBM – it still positions itself as not being in the applications business. Sure, Aprimo provides Teradata a chance to sell an additional product to consumer product companies, but today’s session provided little insight as to the long-term synergies that it will provide to the mother ship.</p>
<p>As to the applications issue, well, that’s a natural issue that any vendor in the middle or data tiers has got to confront because (1) the enterprise software market continues to consolidate, and vendors can’t stand still when it comes to growing their footprint and (2) the natural direction to embed more logic in the middle or data tiers will thrust otherwise agnostic software vendors into the apps space whether they consciously intend to get there or not. </p>
<p>In Teradata’s case, it’s been gradually heading this direction for years with its vertical industry data models, so at some point, as the company strives to aim higher up the value chain, it has to add more logic that could be construed as applications. Same thing for IBM with its vertical industry oriented middleware frameworks.</p>
<p>But ironically what drew the spotlight was the plan for Teradata <a href="http://www.onstrategies.com/blog/2011/03/03/big-data-consolidation-march-contines-teradata-buys-aster-data/">and its other acquisition</a>, <a href="http://www.asterdata.com/">Aster Data</a>. Ironic because it wasn&#8217;t even on the official program today &#8212; <a href="http://www.asterdata.com/about/management.php">Tasso Argyros</a>, who co-founded Aster Data, won&#8217;t be speaking until tomorrow. It prompted questions from the peanut gallery as to how Teradata was going to get into the big data market, which prompted Teradata to throw out the challenge to those of us cynical questioners as to how would we define big data. “I hate [the term] big data,” stated Randy Lea, VP of product marketing and management, as the term has become one of those buzzwords that means all things to all people.</p>
<p>The irony of course is that Teradata’s heritage was having a platform that could house bigger data warehouses; it essentially invented the original Big Data market 30 years ago, when Big Data was measured in megabytes. But there is a different vibe to big data today, not only in volume, but the variety of forms – and some say, the velocity at which it comes in. We’d also add, it also has a different vibe when it comes to governance, whether that means archiving or dealing with privacy and confidentiality over data that was theoretically made public in a social network, but not necessarily in the context of a marketing database maintained by a third party.</p>
<p>Although parts of the briefing veered into non-disclosure territory, we still left the day with confirmation of our existing belief that in the long run there will be convergence of traditional SQL data warehouse platforms with the new Advanced SQL technologies associated with MapReduce and other capabilities that allow them to process ridiculous amounts of data, fast. We also believe that there will not only be convergence between SQL and <a href="http://labs.google.com/papers/mapreduce.html">MapReduce</a> (already happening and public with many vendors), but also with the principles of <a href="http://nosql-database.org/">NoSQL</a> data stores. From that standpoint, it was quite interesting that almost every third question from the audience was in some way related to, what will Teradata do with Aster Data?:</p>
]]></content:encoded>
			<wfw:commentRss>http://www.onstrategies.com/blog/2011/08/04/from-big-to-bigger-data-first-thoughts-from-teradata-influencer-summit/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Hadoop Ecosystem Starts Crystallizing</title>
		<link>http://www.onstrategies.com/blog/2011/06/06/hadoop-ecosystem-starts-crystallizing/</link>
		<comments>http://www.onstrategies.com/blog/2011/06/06/hadoop-ecosystem-starts-crystallizing/#comments</comments>
		<pubDate>Mon, 06 Jun 2011 04:15:38 +0000</pubDate>
		<dc:creator>Tony Baer</dc:creator>
				<category><![CDATA[Big Data]]></category>
		<category><![CDATA[Business Intelligence]]></category>
		<category><![CDATA[Data Management]]></category>
		<category><![CDATA[Database]]></category>

		<guid isPermaLink="false">http://www.onstrategies.com/blog/?p=720</guid>
		<description><![CDATA[What a difference a year makes. A year ago, Big Data was an abstract concept left to the domain of a bunch of niche players and open source groups. Over the next 9 months, the Advanced SQL space dramatically consolidated as EMC, IBM, HP, and Teradata made their moves. In the past 3 months, it’s [...]]]></description>
			<content:encoded><![CDATA[<p>What a difference a year makes. A year ago, Big Data was an abstract concept left to the domain of a bunch of niche players and open source groups. Over the next 9 months, the Advanced SQL space dramatically consolidated as EMC, IBM, HP, and Teradata made their moves. In the past 3 months, it’s been <a href="http://hadoop.apache.org/">Hadoop’s</a> turn.</p>
<p>We’ve seen <a href="http://online.wsj.com/article/SB10001424052748704729304576287343518337646.html?KEYWORDS=yahoo">Yahoo flirt</a> with the idea of setting up its response to Cloudera and IBM with its own Hadoop support company, while <a href="http://greenplum.com/news/press-releases/emc-delivers-hadoop-big-data-analytics-to-the-enterprise">EMC announced ambitious but ambiguous plans</a> to – choose your term – extend or fork Hadoop. After a series of increasingly vocal hints, IBM has placed its cards on the table, while Informatica has fleshed out its plans for civilizing <a href="http://nosql-database.org/">NoSQL</a> data.</p>
<p><a href="http://ibm.com">IBM’s</a> <a href="http://www-01.ibm.com/software/data/infosphere/biginsights/">InfoSphere BigInsights</a> productizes what IBM’s been talking about for months and vocalized at its BigData analyst summit held at its Yorktown Lab (yup, the place where <a href="http://www-03.ibm.com/innovation/us/watson/?cn=agus_watson-20100712&#038;cm=k&#038;csr=google&#038;cr=jeopardy_watson&#038;ct=USJWK002&#038;S_TACT=USJWK002&#038;ck=jeopardy_watson&#038;cmp=00000&#038;mkwid=sRONsUpeR_9199316133_432n0d3749">Watson</a> <a href="http://www.jeopardy.com/minisites/watson/">played Jeopardy</a>). They’re offering the core freebie, which includes a distribution of Hadoop and the <a href="http://hadoop.apache.org/hdfs/">HDFS</a> file system, <a href="http://labs.google.com/papers/mapreduce.html">MapReduce, and integration to DB2, paid support, and an enterprise edition that adds indexing, integrated text analytics, a development studio based around <a href="http://code.google.com/p/jaql/">Jaql</a>, a SQL-like query language developed by Google that takes elements of <a href="http://hive.apache.org/">Hive</a> and <a href="http://pig.apache.org/">Pig</a>, and targets <a href="http://www.json.org/">Json</a> (the data objects of JavaScript), access control security features, and the requisite administrative console.</p>
<p>Contrary to <a href="http://greenplum.com/">EMC</a>, which hedged its words when describing if it would support Apache Hadoop, IBM came down clearly on the side of aligning its effort with the Apache projects. We shouldn’t be surprised, as IBM gave Yahoo’s VP of Hadoop development, <a href="http://developer.yahoo.com/blogs/hadoop/posts/author/eric14/">Eric Baldeschwieler</a>, the soapbox at its analyst event pleading for Hadoop not to be forked into competing technology implementations.</p>
<p><a href="http://informatica.com">Informatica</a> in turn fleshed out its big data support, which was the highlight of <a href="http://www.informatica.com/more/9-1-launch/">its 9.1 platform release being announced today</a>. While Informatica already provides the ability to extract data from Hadoop for ETL to SQL data warehouses, the 9.1 release adds new adapters for social networks LinkedIn, Twitter, and Facebook; new capabilities to connect to call detail records and image files as part of its B2B unstructured data exchange offering. More importantly, whereas before Informatica PowerCenter could extract data from Hadoop, now it can feed data back in, providing another path for tapping the power of MapReduce that might not otherwise be easily supported in your relational data warehouse.</p>
<p>This is the start of the taming of so-called “unstructured” data that populates NoSQL; in actuality, most of this data has structure, much of which has yet to be defined. Informatica’s release of social network adapters targets the lowest hanging fruit, as social media sentiment analysis has become one of the most popular use cases for building data warehouses on steroids. It couples well with text analytics, which was one of the BI market’s first forays outside the transaction world. But there are many other NoSQL data types awaiting some form of structural definition such as sensory, graph, or rich media meta data (some of this could leverage text parsing capabilities).</p>
<p>It’s still early days for commercialization of tooling for big data; while 2010 was the year that major database and platform players discovered Advanced SQL, 2011 is the point where they began directing attention at NoSQL. You can see that on the Advanced SQL side as the use cases <a href="http://www.informationweek.com/news/software/bi/229900002">are pouring out</a>. For NoSQL, and more specifically Hadoop, commercialization moves are just the first steps, <a href="http://blogs.forrester.com/james_kobielus/11-06-03-hadoop_is_it_soup_yet">as Jim Kobielus points out</a>. </p>
<p>Hadoop itself is a fairly complex ecosystem of Apache projects; saying that you support Hadoop is not the same as that for Linux because it lacks Linux’s singular nature. And different pieces of Hadoop are interchangeable: for instance, you can swap out its HBase table system for Cassandra or Cloudbase if you want something more interactive.</p>
<p>For now there is an infatuation with Hadoop, but works remains to be done for vendors to lift the burden off customers for integrating the disparate pieces.</p>
<p>Furthermore the <a href="http://ria101.wordpress.com/tag/hbase-vs-cassandra/">technology use cases</a> are only starting to be fleshed out for what to use where. Inevitably this will lend itself to a solution rather than raw database tools approach for the more popular use cases such as instant or long term social activity graph analysis for marketing, civil infrastructure management, telco churn management, and so on. Furthermore, the bigness of big data means that you might want to attack certain tasks differently. For instance, once the data is at rest, you don&#8217;t want to move it. Data governance in the NoSQL environment is still a blank slate waiting to be filled with best practices, not to mention tooling support. For instance, while Facebook data might be available by public API, will having access to that data trigger any customer privacy issues? Also, while Hadoop’s file system provides relatively low cost storage when measured per terabyte, at some point there will be need to profile, cleanse, compress, and eventually deprecate that data. Again, more white space for tooling and best practices.</p>
<p>IBM’s embrace of what otherwise appears to be an obscure query language is yet another indicator that aside from general “brand” awareness of Hadoop and MapReduce (which is a framework, not a language or technology), the target market of enterprise developers remains in learning mode and as yet lacks knowledge to choose the right tools for the job.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.onstrategies.com/blog/2011/06/06/hadoop-ecosystem-starts-crystallizing/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

