Semprini

Lies, damn lies and statistics

Entries for category "3. Data - BI/Analytics/Science"

There and back again - Analytics Data Pipeline

As an architect it's my job to talk and write a lot of bollocks. One must constantly be on the forefront of buzzword lore to maintain ones architectural standing and feeling of general superiority over lesser IT peons.

However, in this obviously correct and proper feudal type system, what sometimes gets lost or confused is the practicality of implementation and shared understanding. The same architectural buzzwords can be interpreted in so many ways so it becomes difficult for people implementing to realise the actual benefit. This means uninspired doers either take no notice or go off on a tangent which looses any architectural benefit intended.

This is why I'm not a fan of ivory tower architects and IT leaders who shoot Google arrows and Gartner bolts from the parapet, and I prefer to wield a bloody big sword in the melee of IT implementation. This blog is about a quick PoC I had a bash at for a pipeline which streams from a simulated on-prem source to a data lake-house.

Data Convergence

There is so much scope for change in IT. It's quite a pure form of expression of ideas because its underlying logic gates have been virtualized and almost completely abstracted. There are no laws of physics that apply.

But Semprini, you wizened lothario of technology, why then is the pace of change at most companies ever slower?

Excellent question my intrepid reader, lets dive into this and hopefully even plausibly relate it to the topic of the blog - data convergence.

Tikanga Data

A language can say a lot about a culture. All words are made up, therefore the things and ideas that have evolved to have their own words can provide some insight to the nature of a society.

"Tikanga is an inner form of life that manifests itself in one’s conduct. Good intention is embodied in character traits so the philosophy is neither pragmatism nor materialism - it's the character of a person which is given a primary place in virtue ethics." - Piripi Whaanaga. Related to Tikanga is the concept of Manaaki - which is derived from the word ‘mana’ (prestige) and the word to encourage ‘aki.’ Thus an important component of restoring balance is encouraging or building up mana.

This seems like a good place to start for an ethics framework for data.

Model Driven Generation

Where is the truth of an organisation? Many programmers would say that the truth is in the code as this is what is actually running the organisation. The code is in effect a realization of business demand and therefore the truth of what an organisation actually is can be found within.

I think that while somewhat accurate, it is a little 'ambulance at the bottom of the cliff' thinking and we should rather strive to be declarative in what the organisation is. Also, assuming we buy for commodity then the view is obscured and incomplete. The declaration of a business is best done via modelling as the related views can inform and align everyone from board to developers.