Accurate three-dimensional (3D) localization is critical for robust human-robot collaboration (HRC) in dynamic indoor environments. However, realizing high-precision localization in complex scenarios still faces challenges such as multipath effects, field-of-view occlusion, etc. To address these limitations, we propose Geo-LSTM, a geometry-constrained long short-term memory (LSTM) framework that integrates ultra- wideband (UWB) sensors, inertial measurement unit (IMU), and barometric pressure (BMP) sensors. First, a Simplified Geometric Localization (SGL) algorithm is proposed, which uses dual-BMP sensors and IMU sensor to obtain precise height information and utilizes the geometric relationships between the UWB tag and anchors to compute an initial location estimate, serving as a priori input for the Geo-LSTM network. This Geo-LSTM algorithm then incorporates multi-source geometric information to extract time-series features from the UWB ranging data and the tag's a priori location, further enhancing 3D localization accuracy. The experimental results from the cluttered indoor environments, including real-world HRC tasks with occlusions, show that the Geo-LSTM algorithm achieves an average 3D localization root mean square error (RMSE) of 0.103 m, representing improvements of 38.60% and 31.20% over the weighted least squares (WLS) method and the range-based LSTM algorithm, respectively. These results demonstrate Geo-LSTM's potential for reliable multi-sensor 3D localization in HRC applications.