Address errors encountered within the portal to ensure continuous access to the management suite. Contact your system administrator if the resolution requires infrastructure-level changes.
Performing system diagnostics
Use the built-in command-line tools on the WarehousePG Enterprise Manager (WEM) host to identify configuration errors or connectivity gaps.
Validate active configuration settings
Run the setup verification tool to ensure your current environment variables and database strings are functional:
wem setup --verifySee wem setup command reference for details.
Perform a comprehensive health audit
Check for missing dependencies, incorrect file permissions, or service-level connectivity issues:
wem doctor
See wem doctor command reference for details.
Check WEM logs
For service-level events (startup failures, restarts):
sudo journalctl -u wem -n 50 --no-pager
For application-level logs (alert evaluation, canary checks, query activity):
sudo tail -f /var/log/wem/wem.log
Connectivity issues
Error: "Failed to connect to WEM database" during setup
When running wem setup or wem setup --verify, the command fails with an error similar to:
Setup failed: failed to connect to WEM database: failed to ping database: failed to connect to `user=gpadmin database=wem`
Cause: The WHPG_HOST or WEM_HOST environment variables aren't set in the current session, or the WarehousePG database isn't running.
Solution:
Export the required host variables before running the command:
export WHPG_HOST=<coordinator-hostname> export WEM_HOST=<coordinator-hostname> wem setup --verify
If the database itself is not running, start it first:
gpstart -aThen re-run
wem setup --verify.
Issue: WEM service failed to start
Running sudo systemctl start wem returns an error:
Job for wem.service failed because the control process exited with error code. See "systemctl status wem.service" and "journalctl -xe" for details.
Cause: WEM can't reach the WarehousePG database at startup.
Solution:
Check whether WarehousePG is running:
gpstate
If the database is down, start it:
gpstart -aStart WEM again and verify the status:
sudo systemctl start wem sudo systemctl status wem
Issue: Can't connect to the database
If WEM is unable to reach the WarehousePG cluster from the portal:
Ensure the database is active and accepting local connections:
psql -d postgres -c "SELECT version();"
Verify that the WEM connection strings are correctly set in the environment:
env | grep WHPG
Use the built-in WEM tool from the WEM host to validate the current configuration:
wem setup --verifyTest the credentials directly via the CLI using the same parameters defined in the WEM Settings tab within the Management panel.
PGHOST=localhost PGUSER=gpadmin psql -d postgres -c "SELECT current_database();"
Service identity and PXF issues
Error: gpadmin OS user not found during install
When installing the WEM package, the install fails with:
ERROR: gpadmin OS user not found
Cause: The WEM host doesn't have a gpadmin OS user. WEM requires a gpadmin OS user. On WHPG hosts it already exists, but on standalone WEM hosts it must be created manually.
Solution: Create the gpadmin user, then re-run the install:
sudo useradd -r -m -d /home/gpadmin -s /bin/bash gpadmin
Issue: WEM service stays in activating (auto-restart) after upgrade
Running systemctl status wem shows activating (auto-restart) and the log shows wem refuses to start as root.
Cause: A custom service configuration override sets User=root or clears the user setting.
Solution: Remove any custom configuration files in /etc/systemd/system/wem.service.d/ that set User=root, then reload the service configuration:
sudo systemctl daemon-reload sudo systemctl restart wem
Alternatively, set WHPG_ALLOW_ROOT=1 in /etc/wem/wem.conf to acknowledge running as root.
Issue: Start PXF and Stop PXF buttons fail with Operation not permitted
The Start PXF and Stop PXF buttons on the External Tables tab return Operation not permitted.
Cause: The WEM service configuration is missing or out of date for the active PXF_BASE. The issue commonly occurs after installing PXF on a host that already has WEM, or after running pxf cluster prepare -b.
Solution: Run wem configure-pxf-sandbox to detect the active PXF_BASE and update the WEM service configuration, then restart the service to apply the change:
sudo wem configure-pxf-sandbox sudo systemctl restart wem
Authentication and access issues
Message: "Session expired"
Cause: Your security token has timed out due to a period of inactivity.
Solution: Select Log In to return to the authentication screen and re-enter your credentials.
Note
Any unsaved changes in forms or the query editor are lost upon session expiration. Regularly save your configuration changes and avoid long periods of idle time with the browser tab open.
Error: "Permission denied"
Cause: Your assigned role doesn't have the authorization required to perform the requested action.
Solution:
- Verify your current role in the top right bar.
- Review the Role permissions matrix to confirm if the action is permitted for your role.
- If you require elevated access, contact your administrator to request a role change.
Query editor restrictions
Issue: Query is blocked
Symptoms:
- "Query blocked" error messages.
- Inability to execute
INSERT,UPDATE, orDELETEstatements. - DDL commands (
CREATE,DROP) are rejected.
Cause: WEM enforces role-based SQL restrictions to prevent accidental data loss or unauthorized schema changes. Review the Role permissions matrix to confirm if the action is permitted for your role.
Observability and metrics
Issue: Charts aren't displaying
Some tabs display the error Prometheus not configured. Set PROMETHEUS_URL to enable metrics charts.
Cause: The connection to the Prometheus metrics server is either not correctly configured or down.
Solution:
- Verify that Prometheus is running and reachable via the URL defined in the Settings tab within the Management panel.
- Check network connectivity and firewall rules between the WEM server and the Prometheus endpoint.
Issue: Charts are empty despite Prometheus being configured
Prometheus is reachable but WEM charts show no data.
Cause: Metrics are not flowing from the Collector to Prometheus. The Collector may not be running, may not be configured correctly, or the ./deploy-observability script may not have been run after configuration changes.
Solution:
Check whether the metric exists in Prometheus by running the following command on the coordinator host. Replace
<prometheus-host>with the hostname from yourPROMETHEUS_ENDPOINTconfiguration:curl 'http://<prometheus-host>:9090/api/v1/query?query=warehousepg_observability_node_cpu_seconds_total'
If the response contains
"result":[], metrics are not reaching Prometheus.Inspect the Collector logs for errors:
sudo cat /var/log/whpg-observability-collector/whpg-observability-collector.log
Verify that
PROMETHEUS_ENDPOINTin/var/lib/whpg-observability-collector/collector.confis correct and that Prometheus is reachable from the coordinator.If you recently edited
collector.conf, re-run the deployment script to apply the changes:cd /var/lib/whpg-observability-collector ./deploy-observabilityIf Prometheus is running in Docker, confirm that the
--web.enable-remote-write-receiverflag is set, or that theprometheus.ymlincludesremote_writereceiver configuration. Without this flag, Prometheus silently ignores incoming metrics from the Collector.
Issue: Logs aren't loading
The Historical Logs (Coordinator & Segments) tab within the Logs panel reports the error: Loki integration is not configured.
Cause: The Loki log aggregation service is unavailable or the URL is incorrect.
Solution:
- Ensure the Loki service is active.
- Verify the Loki URL in the Settings tab within the Management panel.
- Check server-side logs for
Connection Refusederrors.