Services
I offer professional services around web crawling, data engineering and open source software. My work combines academic rigour with hands-on engineering and an open source mindset.
These services are provided through my software company, Colorful Bytes. Get in touch via Colorful Bytes or by email to discuss how I can help.
Talks & Speaking
Talks for conferences, meetups and corporate events on topics such as:
- Building scalable, focused web crawlers (e.g. on Apache Storm & StormCrawler).
- AI-assisted software engineering: how AI is reshaping the way we build software.
- Mining the web: information retrieval and text mining at scale.
- Working in and with the Apache Software Foundation: open source done in the open.
Training & Workshops
Hands-on, interactive sessions for engineering teams:
- AI-assisted software engineering: using AI tooling effectively across the development workflow.
- Designing and operating distributed web crawling architectures.
- Building and optimising big-data pipelines for web-scale data.
- Contributing to and sustaining open source projects (community, governance, release process).
Advisory & Consulting
Strategic and technical guidance, with a vendor-neutral perspective:
- Architecture reviews for web crawling and data-extraction systems.
- Scaling, reliability and cost optimisation of data pipelines.
- Adopting AI-assisted software engineering: tooling, workflows and team practices.
- Open source strategy: tooling choices, licensing, and engaging with upstream communities.
Open Source
As an ASF Member and PMC Chair of Apache Storm and Apache StormCrawler, I help organisations adopt, integrate and contribute back to open source projects. See the Open Source page for the projects I am involved with.