- Originally Posted by Tony Cerqueira on Wed, Dec 02, 2009 @ 06:02 PM
The New Yorker Magazine has is a great read for anyone considering the strategic aspects of real-time versus batch processes, (from databases, to running a girl’s basketball team) in this article titled “How David Beats Goliath: When underdogs break the rules“.
In the world of storage and information management and protection, the parallels to current legacy point products are impressive. Today’s leading backup products reside completely upon legacy architectures. They are still, by and large, run as batch processes, are not searchable, do not provide real-time differencing, and have no real-time capability to tie into other data movement or data management capabiliites. You could say many of the same things about many other tools used within the IT dept.
It would be nice to turn a key, and make it all real-time, but that won’t happen. Fundamentally, it requires changes in the way systems, physical or Virtual Machines, are managed, and how responsibilities are distributed (if they are). The legacy Client/Servers approaches completely rely on outdated policy distribution communications (batch), where connectivity must remain intact to execute their “server -to- slave server -to- media server -to- client” laundry list of batch “stuff to do”. They need a lot of hand holding in order for things to happen, and for policies to be executed. A short list of issues with legacy products:
o- Batch methods require scans, trawls, polls, etc., all of which drag down resources
o- Batch I/O stacks up fast on VMs, and goes medieval on their host systems
o- Data changes can be discerned, but data touches cannot be tracked
o- Data classification, if any, is after the fact, instead of at “point of creation”
o- Compliance is via “batch” time slices, not real world “second-by-second” views
o- Metadata consistency is always a day late and a dollar short
o- Repository data always has a “window” of difference with primary data
o- Deduplication remains after the fact and separate
If users want to explore the road of real-time, they will need to seek new solutions that are outside of the realm of their current vendor portfolio, because vendor leaders just have too much invested in existing legacy code bases. New architectures, which provide self-managing nodes, together with scalable and distributed storage, are the key to deploying more value across the enterprise, on a granular, simple and cost effective basis. . . . Did I mention . . uhm . . . AIMstor?

