-- CH-01: WORK

Pipeline runs.

Every project in pipeline format. Input, process, output. Grouped by signal type. The full chain from source to delivery.

5 Scraping and Data Extraction 2 Data Pipelines and Automations 2 Analytics and Dashboards 2 Predictions / Modeling 2 Environment / GIS

CLUSTER 01 -- 5 PROJECTS 

Scraping and Data Extraction

[JOB 001]

Reddit Job Intelligence Platform

COMPLETE

Built a full scraping and intelligence system that monitors Reddit job communities in real time, classifies posts using NLP, and serves insights through a live dashboard. Designed to cut through noise and surface actionable job market signals.

 INPUT Reddit job communities -- dynamic content, pagination, anti-bot barriers 
 PROCESS Selenium + BeautifulSoup scraper -- LLM-assisted NLP classification -- SQL storage 
 OUTPUT Real-time Streamlit dashboard for job market intelligence 

Multi-community coverageAutomated NLP classificationLive dashboard

PythonSeleniumBeautifulSoupSQLStreamlitNLP

[JOB 002]

Sydney Commercial Property Lead Gen

COMPLETE

High-concurrency async scraper targeting Sydney commercial property developers. Extracted structured lead data from DA PDFs and leasing brochures across multiple council portals, delivering clean prospect lists for a real estate client.

 INPUT Sydney council DA portals, leasing brochures, commercial property directories 
 PROCESS Async Python + Apify actors -- pdfplumber PDF extraction -- deduplication and enrichment 
 OUTPUT Structured CSV of developer contacts with company, email, phone, and project details 

Multiple council sourcesPDF + web extractionAsync pipeline

PythonAsyncioApifypdfplumberPandas

[JOB 003]

B2B Lead Generation -- Australian Market

COMPLETE

Targeted lead list for a concrete cutting company expanding into Melbourne growth corridors. Extracted verified contacts from construction directories with strict accuracy requirements on ABN, service area, and contact validity.

 INPUT Australian construction and trade directories -- Melbourne metro focus 
 PROCESS Multi-source scraping -- field validation -- deduplication -- manual QA pass 
 OUTPUT 1,200+ verified B2B contacts with email, phone, ABN, and service area. Clean CSV delivery 

1,200+ verified contacts48hr turnaroundStrict accuracy QA

PythonBeautifulSoupPandas

[JOB 004]

Swedish Metal & Steel Company Leads

COMPLETE

Compiled a targeted email list of Swedish metal and steel manufacturers for a B2B outreach campaign. Extracted company profiles, decision-maker contacts, and verified emails from European industrial directories.

 INPUT European industrial directories and company registries -- Sweden focus 
 PROCESS Directory scraping -- company profiling -- email extraction and verification 
 OUTPUT Verified lead list with company name, industry segment, contact person, and email 

Targeted industry verticalVerified emailsEuropean market

PythonBeautifulSoupPandas

[JOB 006]

Lagos Rent Price Predictor

COMPLETE

End-to-end system: scraped 10,000+ rental listings from a JS-rendered Nigerian property platform, engineered location and property features, trained a Random Forest model, and deployed predictions via a Flask API.

 INPUT 10,000+ property listings from a JS-rendered real estate platform 
 PROCESS Multi-level scraper -- feature engineering pipeline -- Random Forest model 
 OUTPUT Flask API delivering real-time rent predictions for Lagos properties 

10,000+ listings scrapedRandom Forest modelLive API endpoint

PythonBeautifulSoupSeleniumScikit-learnFlaskPandas

CLUSTER 02 -- 2 PROJECTS 

Data Pipelines and Automations

[JOB 007]

PSX ESG Controversy Validation Pipeline

COMPLETE

Validated 480+ ESG controversy records for Pakistan Stock Exchange listed firms. Built an LLM-powered pipeline that cross-referenced claims against source data, flagged inaccuracies, and produced a fully traceable correction log.

 INPUT 500+ ESG controversy records requiring factual accuracy validation across 480 firms 
 PROCESS Apify extraction -- sequential LLM batching via OpenRouter -- quality-control checks 
 OUTPUT Validated, corrected ESG dataset with traceable error documentation 

480 firms processedLLM-validatedTraceable corrections

PythonApifyOpenRouterClaude APIPandas

[JOB 008]

Automated X (Twitter) Content Pipeline

LIVE

Built a fully automated content pipeline that generates, schedules, and publishes posts to X (Twitter). Uses Claude API for content generation, Make for orchestration, and Buffer for scheduling. Runs hands-free.

 INPUT Content prompts and topic seeds 
 PROCESS Claude API content generation -- Make scenario orchestration -- Buffer scheduling 
 OUTPUT Automated daily X posts published on schedule without manual intervention 

Fully automatedClaude API poweredHands-free publishing

MakeClaude APIBufferPython

CLUSTER 03 -- 2 PROJECTS 

Analytics and Dashboards

[JOB 009]

Restaurant Menu Profitability Analysis

LIVE

Analyzed 547,918 POS transactions ($6.2M revenue) for a restaurant chain using menu engineering methodology. Classified every menu item as Star, Plowhorse, Puzzle, or Dog. Identified $209K-$271K in annual profit improvement opportunities.

 INPUT 547,918 POS transactions totaling $6.2M in revenue 
 PROCESS Pandas ETL -- menu engineering matrix (Stars, Plowhorses, Puzzles, Dogs) -- profitability modeling 
 OUTPUT $209K-$271K projected annual profit improvement. Live Streamlit dashboard 

547K transactions$6.2M revenue analyzed$209K-$271K improvement

PythonPandasPlotlyStreamlit

[JOB 010]

Shopify Sales Performance Dashboard

COMPLETE

Built an interactive dashboard for a Shopify store processing 65,000+ transactions ($2.69M revenue). Covers KPI tracking, seasonal trends, payment method analysis, and heatmap visualizations for sales patterns.

 INPUT 65,000+ e-commerce transactions totaling $2.69M revenue 
 PROCESS Pandas ETL -- KPI computation -- seasonal trend and payment method analysis 
 OUTPUT Interactive Streamlit dashboard with heatmaps, KPI cards, and trend charts 

65K+ transactions$2.69M revenueMulti-dimensional analysis

PythonPandasPlotlyStreamlit

CLUSTER 04 -- 2 PROJECTS 

Predictions / Modeling

[JOB 011]

Customer Churn Prediction Pipeline

COMPLETE

Full ML pipeline for predicting customer churn in a telecom dataset. Automated preprocessing, feature engineering, and model training with Random Forest and Logistic Regression. Evaluated with accuracy, precision, recall, F1, and ROC-AUC.

 INPUT Telecom customer dataset with behavioral and usage features 
 PROCESS Automated preprocessing -- feature engineering -- Random Forest and Logistic Regression 
 OUTPUT Full ML pipeline with accuracy, precision, recall, F1, and ROC-AUC metrics 

Multi-model comparisonFull evaluation suiteAutomated pipeline

PythonScikit-learnPandasJupyter

[JOB 012]

Lagos Rent Price Predictor -- ML Model

COMPLETE

The modeling layer of the Lagos Rent system. Trained a Random Forest regressor on engineered features from 10,000+ scraped listings. Deployed as a Flask API for real-time price predictions based on location, size, and property type.

 INPUT 10,000+ cleaned property listings with engineered features 
 PROCESS Feature engineering -- Random Forest training -- hyperparameter tuning -- Flask deployment 
 OUTPUT Production Flask API serving rent price predictions for Lagos neighborhoods 

10K+ training samplesRandom Forest regressorProduction API

PythonScikit-learnFlaskPandas

CLUSTER 05 -- 2 PROJECTS 

Environment / GIS

[JOB 013]

Groundwater Heavy Metal Contamination Study

COMPLETE

Spatial analysis of heavy metal concentrations in groundwater samples. Mapped contamination hotspots using GIS tools, assessed health risk indices, and produced visualizations for environmental compliance reporting.

 INPUT Groundwater sampling data with heavy metal concentrations across multiple sites 
 PROCESS Spatial interpolation -- contamination mapping -- health risk index calculation 
 OUTPUT GIS maps of contamination hotspots with risk assessment documentation 

Multi-site analysisHealth risk indicesCompliance-ready output

PythonQGISGeoPandasMatplotlib

[JOB 014]

Global Electricity vs GDP Dashboard

COMPLETE

Interactive dashboard exploring the relationship between electricity consumption and GDP across countries. Visualizes development patterns, energy intensity trends, and regional comparisons.

 INPUT Global electricity consumption and GDP datasets across countries and years 
 PROCESS Data cleaning -- cross-country normalization -- trend and correlation analysis 
 OUTPUT Interactive Power BI dashboard with global maps, trend lines, and regional filters 

Global coverageMulti-year trendsInteractive filtering

Power BIPythonPandas

company	contact_person	email	phone	service_area	abn_status
Develop4u	Gaby Elsusu	[email protected]	0424 381 668	Balgowlah Heights, Seaforth	Verified
Universal Group Pty Ltd	Spiros Tsiaousis	[email protected]	0410 111 122	145 North Steyne, 54 The Outlook, 24 Prince Alfred Parade	Verified
Palladium Property	Phillip Hoare	[email protected]	0408 997 675	Campbelltown, Austral, Lane Cove, Ingleburn, Yennora	Verified
SwoopLand	Nick Marino	[email protected]	0438 882 818	Bellevue Estate (Austral), Liverpool Central, Rickard Gardens, Leppington Square, Le Vista	Verified
The Heaton Group	William Heaton	[email protected]	0412 229 942	The Archibald (Mosman), The Balgowlah, The Balmoral, ABODE (Mosman)	Verified
Kurraba Group	Lachlan Clancy	[email protected]	0408 443 117	6 active development projects (Sydney)	Verified
Clifton Lifestyle	Tracey Davis	[email protected]	1300 081 110	Bondi Junction, Sydney	Verified
D.velop.R	Matt Hall	[email protected]	0421 579 044	Caringbah, Sutherland Shire	Verified
Property Development Workshops	Jim Castagnet	[email protected]	0419 220 022	Bondi Junction — residential/mixed-use	Verified
Podia	John Melville	[email protected]	+61 409 368 541	Pyrmont NSW	Verified

company	contact_person	email	phone	service_area	abn_status
Urban Retreat Day Spa	Mel; Darren	[email protected]	+61895293333	Rockingham WA	Verified
The Healing Stone Day Spa		[email protected]	+61735010335	Brisbane QLD	Verified
MANE Day Spa	Kate	[email protected]	+61413704774	Australia	Verified
The Ritz-Carlton Spa		[email protected]	+61391222888	Melbourne VIC	Verified
Silo Day Spa		[email protected]	+61367000650	Launceston TAS	Verified
Serene Day Spa		[email protected]	+61892458188	Scarborough WA	Verified
Savoy Day Spa Hobart		[email protected]	+61362241586	Hobart TAS	Verified
Gaia Retreat & Spa		[email protected]	+61266871216	Byron Bay NSW	Verified
Canberra Day Spa		[email protected]	+61262579511	Canberra ACT	Verified
Cullen Bay Day Spa		[email protected]	+61449660958	Darwin NT	Verified

company	contact_person	email	service_area	abn_status
PSDAB		[email protected]	Sweden	Verified
Lundgren AB		[email protected]	Sweden	Verified
Petterssons Smide		[email protected]	Sweden	Verified
Grällsta Platsmide		[email protected]	Sweden	Verified
Huluhammar		[email protected]	Sweden	Verified
LPW		[email protected]	Sweden	Verified
MSSAB	Jens	[email protected]	Sweden	Verified
Monsterås Metall		[email protected]	Sweden	Verified
Sedenborgs	Lars Sedenborg	[email protected]	Sweden	Verified
Stålbröderna	Johan	[email protected]	Sweden	Verified

year	company	ticker	sector	incident_date	article_publish_date	esg_pillar	severity	incident_summary	source_outlet	source_url	controversy_score
2016	Abbott Laboratories (Pakistan) Limited	ABOT	PHARMACEUTICALS	42370	42370	G	High	Abbott Laboratories Pakistan, along with five other multinational pharmaceutical companies, increased medicine prices by 15 percent without obtaining required approval from DRAP. The unauthorized price hikes affected medicines used for cardiac ailments, blood pressure, fever, and pain relief, and caused an artificial shortage of drugs in the market.	Dawn	http://www.dawn.com/news/1238769	1
2016	Pakistan State Oil Company Limited	PSO	OIL & GAS MARKETING COMPANIES	42522	42530	G	Medium	Pakistan's OGRA offered oil marketing companies a negotiated resolution regarding disputed licence and annual fees under the new Oil Rules 2016, but the companies refused and pursued legal action, highlighting ongoing governance tensions between Ogra, the Ministry, and the oil industry.	The News International	https://www.thenews.com.pk/print/126467-Ministry-says-Ogra-responsible-for-implementing-new-oil-rules	5
2017	Tandlianwala Sugar Mills Limited	TSML	SUGAR & ALLIED INDUSTRIES	42970	42971	G	High	TSML was identified as the largest defaulter among six sugar mills owing Rs 2.65 billion to the Trading Corporation of Pakistan, with TSML's outstanding amount totaling Rs 1.15 billion. The TCP had filed court proceedings against the defaulters as revealed in an audit report to Pakistan's Public Accounts Committee.	Business Recorder	https://www.brecorder.com/news/amp/4531611	3
2017	Pakistan International Airlines Corporation Limited	PIAA	TRANSPORT	42884	42884	S	High	Two PIA officials were arrested and charged by the FIA for facilitating illegal trafficking of three Afghan women found using fraudulent boarding passes on a UK-bound flight. The investigation uncovered a broader human trafficking network, with the women having paid $20,000 to be trafficked to London.	Dawn	https://www.dawn.com/news/1336139	11
2018	Pakistan International Airlines Corporation Limited	PIAA	TRANSPORT	43227	43338	G	High	Pakistan's Auditor General recommended immediate dismissal of PIA CEO Musharraf Rasool Cyan, characterizing his appointment as an irregular favour, and called for recovery of his salary and benefits and an investigation into whether a prime ministerial adviser was involved.	Dawn	https://www.dawn.com/news/1429019	7
2018	Pakistan Petroleum Limited	PPL	OIL & GAS EXPLORATION COMPANIES	43221	43242	G	Medium	PPL was implicated in an OGRA hearing examining whether LPG producers were permitted to charge signature bonuses above the maximum notified price, following admissions by co-producer OGDCL that it had collected Rs10 billion in signature bonuses in alleged violation of orders from three high courts.	The News International	https://www.thenews.com.pk/print/320004-ogdcl-makes-rs10b-over-above-notified-price-of-lpg	2
2019	K-Electric Limited	KEL	POWER GENERATION & DISTRIBUTION	43714	43714	S	High	NEPRA found K-Electric responsible for 19 of 35 electrocution cases in Karachi during the July-August monsoon season, following a formal investigation into fatal and non-fatal incidents and prolonged power outages caused by urban flooding.	Dawn	https://www.dawn.com/news/1503894	11
2019	Sui Southern Gas Company Limited	SSGC	OIL & GAS MARKETING COMPANIES	43826	43829	S	Medium	SSGC faced criticism from multi-party political leaders in Balochistan over severe gas pressure shortages leaving residents without heating during extreme cold, despite Balochistan having supplied gas to the country since 1952.	Dawn	https://www.dawn.com/news/1525210	5
2020	JDW Sugar Mills Limited	JDWS	SUGAR & ALLIED INDUSTRIES	44099	44100	G	High	The Competition Commission of Pakistan raided JDW Sugar Mills' head office, seizing records as part of an investigation into alleged market manipulation. A joint FIA-led investigation team found sugar millers guilty of misusing public money and artificially inflating sugar prices.	Dawn	https://www.dawn.com/news/1581625	19
2020	JDW Sugar Mills Limited	JDWS	SUGAR & ALLIED INDUSTRIES	44094	44094	G	Medium	JDW Sugar Mills owner Jehangir Tareen was summoned by Pakistan's FIA as part of a sugar scam investigation. Tareen, in the UK for medical treatment, submitted a short reply requesting additional time to respond to queries regarding his businesses, assets, and sugar mills.	Dawn	https://www.dawn.com/news/1580642	19

Pipeline runs.

Scraping and Data Extraction

Reddit Job Intelligence Platform

Sydney Commercial Property Lead Gen

B2B Lead Generation -- Australian Market

Swedish Metal & Steel Company Leads

Lagos Rent Price Predictor

Data Pipelines and Automations

PSX ESG Controversy Validation Pipeline

Automated X (Twitter) Content Pipeline

Analytics and Dashboards

Restaurant Menu Profitability Analysis

Shopify Sales Performance Dashboard

Predictions / Modeling

Customer Churn Prediction Pipeline

Lagos Rent Price Predictor -- ML Model

Environment / GIS

Groundwater Heavy Metal Contamination Study

Global Electricity vs GDP Dashboard

Proof