MCP (Model Context Protocol) server for Malaysia's Open Data APIs, providing easy access to government datasets and collections.
Do note that this is NOT an official MCP server by the Government of Malaysia or anyone from Malaysia's Open Data/Jabatan Digital Negara/Ministry of Digital team.
- Enhanced Unified Search with flexible tokenization and synonym expansion
- Intelligent query handling with term normalization
- Support for plurals and common prefixes (e.g., "e" in "epayment")
- Smart prioritization for different data types
- Parquet File Support using pure JavaScript
- Parse Parquet files directly in the browser or Node.js
- Support for BROTLI compression
- Fallback to metadata estimation when parsing fails
- Automatic dashboard URL mapping for visualization
- Hybrid Data Access Architecture
- Pre-generated static indexes for efficient searching
- Dynamic API calls for detailed metadata
- Multi-Provider Geocoding
- Support for Google Maps, GrabMaps, and Nominatim (OpenStreetMap)
- Intelligent service selection based on location and available API keys
- GrabMaps optimization for locations in Malaysia
- Automatic fallback between providers
- Comprehensive Data Sources
- Malaysia's Data Catalogue with rich metadata
- Interactive Dashboards for data visualization
- Department of Statistics Malaysia (DOSM) data
- Weather forecast and warnings
- Public transport and GTFS data
- Multi-Provider Malaysian Geocoding
- Optimized for Malaysian addresses and locations
- Three-tier geocoding system: GrabMaps, Google Maps, and Nominatim
- Prioritizes local knowledge with GrabMaps for better Malaysian coverage
- Automatic fallback to Nominatim when no API keys are provided
This MCP server implements a hybrid approach for efficient data access:
- Pre-generated Static Indexes for listing and searching datasets and dashboards
- Dynamic API Calls only when specific dataset or dashboard details are requested
This approach provides several benefits:
- Faster search and listing operations
- Reduced API calls to external services
- Consistent data access patterns
- Up-to-date detailed information when needed
- TOOLS.md - Detailed information about available tools and best practices
- PROMPT.md - AI integration guidelines and usage patterns
When integrating this MCP server with AI models:
- Use the unified search tool first - Always start with
search_all
for any data queries - Follow the correct URL patterns - Use
https://data.gov.my/...
andhttps://open.dosm.gov.my/...
- Leverage Parquet file tools - Use
parse_parquet_file
to access data directly orget_parquet_info
for metadata - Use the hybrid approach - Static indexes for listing/searching, API calls for details
- Consider dashboard visualization - For complex data, use the dashboard links provided by
find_dashboard_for_parquet
- Leverage the multi-provider Malaysian geocoding - For Malaysian location queries, the system automatically selects the best provider (GrabMaps, Google Maps, or Nominatim) with fallback to Nominatim when no API keys are configured
Refer to PROMPT.md for comprehensive AI integration guidelines.
npm install
To run the MCP server in development mode:
npx @smithery/cli dev
To build the MCP server for deployment:
npx @smithery/cli build
This MCP is designed to be deployed to Smithery. Follow these steps to deploy:
-
Make sure you have the Smithery CLI installed:
npm install -g @smithery/cli
-
Build the project:
npx @smithery/cli build
-
Deploy to Smithery:
npx @smithery/cli deploy
list_datasets
: Lists available datasets in the Data Catalogueget_dataset
: Gets data from a specific dataset in the Data Cataloguesearch_datasets
: Searches for datasets in the Data Catalogue
list_dosm_datasets
: Lists available datasets from DOSMget_dosm_dataset
: Gets data from a specific DOSM dataset
parse_parquet_file
: Parse and display data from a Parquet file URLget_parquet_info
: Get metadata and structure information about a Parquet filefind_dashboard_for_parquet
: Find the corresponding dashboard URL for a Parquet file
get_weather_forecast
: Gets weather forecast for Malaysiaget_weather_warnings
: Gets current weather warnings for Malaysiaget_earthquake_warnings
: Gets earthquake warnings for Malaysia
list_transport_agencies
: Lists available transport agencies with GTFS dataget_transport_data
: Gets GTFS data for a specific transport agency
parse_gtfs_static
: Parses GTFS Static data (ZIP files with CSV data) for a specific transport providerparse_gtfs_realtime
: Parses GTFS Realtime data (Protocol Buffer format) for vehicle positionsget_transit_routes
: Extracts route information from GTFS dataget_transit_stops
: Extracts stop information from GTFS data, optionally filtered by route
hello
: A simple test tool to verify that the MCP server is working correctly
const result = await tools.get_weather_forecast({
location: "Kuala Lumpur",
days: 3
});
const result = await tools.search_datasets({
query: "population",
limit: 5
});
// Parse GTFS Static data
const staticData = await tools.parse_gtfs_static({
provider: "ktmb"
});
// Get real-time vehicle positions
const realtimeData = await tools.parse_gtfs_realtime({
provider: "prasarana",
category: "rapid-rail-kl"
});
// Get transit routes
const routes = await tools.get_transit_routes({
provider: "mybas-johor"
});
// Get stops for a specific route
const stops = await tools.get_transit_stops({
provider: "prasarana",
category: "rapid-rail-kl",
route_id: "LRT-KJ"
});
Please be aware of rate limits for the underlying APIs. Excessive requests may be throttled.
src/index.ts
: Main MCP server implementation and tool registrationsrc/datacatalogue.tools.ts
: Data Catalogue API toolssrc/dashboards.tools.ts
: Dashboard access and search toolssrc/dosm.tools.ts
: Department of Statistics Malaysia toolssrc/unified-search.tools.ts
: Enhanced unified search with tokenization and synonym expansionsrc/parquet.tools.ts
: Parquet file parsing and metadata toolssrc/weather.tools.ts
: Weather forecast and warnings toolssrc/transport.tools.ts
: Transport and GTFS data toolssrc/gtfs.tools.ts
: GTFS parsing and analysis toolssrc/flood.tools.ts
: Flood warning and monitoring toolsDockerfile
: Docker configuration for Smitherysmithery.yaml
: Smithery configurationpackage.json
: Project dependencies and scriptstsconfig.json
: TypeScript configuration
To test locally before deploying to Smithery:
# Start the development server
npm run dev
# Or build and run the production version
npm run build
npm start
# In another terminal, test the hello tool
curl -X POST http://localhost:8182/invoke/hello -H "Content-Type: application/json" -d "{}"
You can also use the Smithery CLI for local development:
# Run in development mode
npx @smithery/cli dev
# Build for production
npx @smithery/cli build
If you encounter deployment issues:
- Ensure your GitHub repository is public or properly connected to Smithery
- Verify that your
Dockerfile
andsmithery.yaml
are in the repository root - Check that the
index.js
file exports a valid MCP server function
This project supports the following configuration options:
Geocoding Credentials (Optional. Only for GTFS Transit Features Usage):
The following credentials are only needed if you plan to use the GTFS transit tools that require geocoding services. Other features like data catalogue access, weather forecasts, and DOSM data do not require these credentials.
- googleMapsApiKey: Optional. If provided, the system will use Google Maps API for geocoding location names to coordinates.
- grabMapsApiKey: Optional. Required for GrabMaps geocoding, which is optimized for locations in Malaysia.
- awsAccessKeyId: Required for GrabMaps integration. AWS access key for GrabMaps API authentication.
- awsSecretAccessKey: Required for GrabMaps integration. AWS secret key for GrabMaps API authentication.
- awsRegion: Required for GrabMaps integration. AWS region for GrabMaps API (e.g. 'ap-southeast-5' for Malaysia region or ap-southeast-1 for Singapore region).
If neither Google Maps nor GrabMaps API keys are provided, the GTFS transit tools will automatically fall back to using Nominatim (OpenStreetMap) API for geocoding, which is free and doesn't require credentials.
You can set these configuration options in two ways:
- Through Smithery's configuration interface when connecting to the MCP server
- As environment variables (GOOGLE_MAPS_API_KEY, GRAB_MAPS_API_KEY) for local development
For local development:
The project uses dotenv
to load environment variables from a .env
file during development.
- Create a
.env
file in the root directory with the following content:
GOOGLE_MAPS_API_KEY=your_google_api_key_here
GRAB_MAPS_API_KEY=your_grab_api_key_here
AWS_ACCESS_KEY_ID=your_aws_access_key_for_grabmaps
AWS_SECRET_ACCESS_KEY=your_aws_secret_key_for_grabmaps
AWS_REGION=ap-southeast-5 # Malaysia region or ap-southeast-1 # Singapore region
- The variables will be automatically loaded when you run the server locally using
npm run dev
For Smithery deployment:
When connecting to your MCP server through Smithery:
- Click on "Connect Malaysia Open Data MCP Server"
- You'll see configuration options for:
googleMapsApiKey
- Google Maps API keygrabMapsApiKey
- GrabMaps API keyawsAccessKeyId
- AWS access key for GrabMapsawsSecretAccessKey
- AWS secret key for GrabMapsawsRegion
- AWS region for GrabMaps (e.g. ap-southeast-5 for Malaysia region or ap-southeast-1 for Singapore region)
- Enter your API keys and AWS credentials in these fields
- Click "Get Link" to generate your connection URL
The API keys will be securely passed to the server during connection.
Note: For Malaysian locations, GrabMaps provides the most accurate geocoding results, followed by Google Maps, with both requiring API keys. If you don't provide either API key, the system will automatically use Nominatim API instead, which is free but may have less accurate results for some locations in Malaysia.
Important: These geocoding credentials are only required for the following GTFS transit tools:
get_transit_routes
- When converting location names to coordinatesget_transit_stops
- When converting location names to coordinatesparse_gtfs_static
- When geocoding is needed for stop locations
Note about GTFS Realtime Tools: The parse_gtfs_realtime
tool is currently in development and has limited availability. Real-time data access through this MCP is experimental and may not be available for all providers or routes. For up-to-date train and bus schedules, bus locations, and arrivals in real-time, please use official transit apps like Google Maps, MyRapid PULSE, Moovit, or Lugo.
All other tools like data catalogue access, dashboard search, weather forecasts, and DOSM data do not require any geocoding credentials.
MIT - See LICENSE file for details.
- Malaysia Open Data Portal
- Department of Statistics Malaysia
- Malaysian Meteorological Department
- Google Maps Platform for geocoding
- GrabMaps for geocoding
- Nominatim for geocoding
- Smithery for the MCP framework