OpenSfM

本文最后更新于：2025年4月30日下午

OpenSfM 是一个用 Python 编写的结构从运动库。该库作为一个处理流水线，用于从多幅图像重建摄像机姿态和3D 场景。本文简介仓库内容，使用方法、数据格式等信息。

简介

OpenSfM 是一个用 Python 编写的结构从运动库。该库作为一个处理流程，用于从多幅图像重建摄像机姿态和3D 场景。它包括从运动结构(特征提取/匹配，最小解决方案)的基本模块，重点是建立一个健壮和可扩展的重建流水线。它还集成了外部传感器(例如 GPS、加速度计)的测量，以实现地理对齐和鲁棒性。提供了一个 JavaScript 查看器来预览模型和调试管道。

OpenSfM 也是众多

仓库地址：https://github.com/mapillary/OpenSfM

官方网站：https://opensfm.org/

官方文档：https://opensfm.org/docs/

使用方法

运行环境

Ubuntu 20.04
Python 3.8

依赖库

opencv ：机器视觉方法集合库
ceres solver ：解决具有边界约束和一般无约束优化问题的非线性最小二乘问题

直接编译安装

比较繁琐复杂，容易报错

下载opensfm仓库

由于 opensfm 分支太多，以防万一手动切换到 v0.5.1

git clone git@github.com:mapillary/OpenSfM.git
cd OpenSfM
git checkout v0.5.1
git submodule update --init --recursive

安装依赖

使用如下命令安装依赖：

1
2
3

sudo apt-get install build-essential cmake libatlas-base-dev libatlas-base-dev libgoogle-glog-dev \
                     libopencv-dev libsuitesparse-dev python3-pip python3-dev  python3-numpy python3-opencv \
                     python3-pyproj python3-scipy python3-yaml libeigen3-dev

安装opengv，官网教程，具体步骤如下（DPYTHON_INSTALL_DIR是要安装到的目录）：

mkdir source && cd source/
git clone --recurse-submodules -j8 https://github.com/laurentkneip/opengv.git
cd opengv && mkdir build && cd build
cmake .. -DBUILD_TESTS=OFF -DBUILD_PYTHON=ON -DPYBIND11_PYTHON_VERSION=3.6 -DPYTHON_INSTALL_DIR=/usr/local/lib/python3.6/dist-packages/
sudo make install

安装ceres，可以按照此步骤

cd ../../
curl -L http://ceres-solver.org/ceres-solver-1.14.0.tar.gz | tar xz
cd ./ceres-solver-1.14.0 && mkdir build-code && cd build-code
cmake .. -DCMAKE_C_FLAGS=-fPIC -DCMAKE_CXX_FLAGS=-fPIC -DBUILD_EXAMPLES=OFF -DBUILD_TESTING=OFF
sudo make -j4 install

安装pip库，然后build这个opensfm的库，安装在pip里面

1 2	`cd ../../../ && pip3 install -i https://pypi.tuna.tsinghua.edu.cn/simple -r requirements.txt python3 setup.py build`

此时opensfm即安装成功。

测试

在opensfm主目录下

1 2	`bin/opensfm_run_all data/berlin python3 -m http.server`

点击viewer文件夹，选择reconstruction.html打开，然后选择上面命令生成的文件data/berlin/reconstruction.meshed.json；也可以在undistorted文件夹下面找到merged.ply文件打开即可。
如果使用SIFT提取特征，需要pip3 install -i https://pypi.tuna.tsinghua.edu.cn/simple opencv-contrib-python==3.4.2.16（opencv-python版本不用改动）

总结流程

# 完整下载OpenSfM仓库（2317fbb），包括里面的pybind11等
git clone --recursive https://github.com/mapillary/OpenSfM opensfm
# 进入opensfm主目录
cd opensfm
git checkout v0.5.1
# 再次更新子模块保证最新
git submodule update --init --recursive
# 更新源
sudo apt-get update
# 安装依赖的包
sudo apt-get install -y \
    build-essential vim curl cmake git \
    libatlas-base-dev libeigen3-dev \
    libgoogle-glog-dev libopencv-dev libsuitesparse-dev \
    python3-dev python3-numpy python3-opencv python3-pip \
    python3-pyproj python3-scipy python3-yaml
# ---------编译安装ceres---------
# 创建临时目录
mkdir source && cd source
# 下载ceres v1.14并解压
curl -L http://ceres-solver.org/ceres-solver-1.14.0.tar.gz | tar xz
# 创建编译文件夹
cd ceres-solver-1.14.0 && mkdir build && cd build
# cmake
cmake .. -DCMAKE_C_FLAGS=-fPIC -DCMAKE_CXX_FLAGS=-fPIC -DBUILD_EXAMPLES=OFF -DBUILD_TESTING=OFF
# 开启48线程编译安装
sudo make -j48 install
# ----------编译安装opengv-------
# 回到source文件夹下
cd ../../
# 下载opengv
git clone https://github.com/paulinus/opengv.git
# 更新子模块保证代码最新
cd opengv && git submodule update --init --recursive
# 创建编译文件夹
mkdir build && cd build
# cmake
cmake .. -DBUILD_TESTS=OFF \
         -DBUILD_PYTHON=ON \
         -DPYBIND11_PYTHON_VERSION=3.6 \
         -DPYTHON_INSTALL_DIR=/usr/local/lib/python3.6/dist-packages/
# 开启48线程编译安装
sudo make -j48 install
# 安装opensfm需要的python库
/usr/bin/pip3 install \
    exifread==2.1.2 gpxpy==1.1.2 networkx==1.11 \
    numpy pyproj==1.9.5.1 pytest==3.0.7 \
    python-dateutil==2.6.0 PyYAML==3.12 \
    scipy xmltodict==0.10.2 \
    loky repoze.lru
# ----------编译opensfm----------
/usr/bin/python3 setup.py build
# 安装特定版本的opencv-contrib，此时可用SIFT特征提取算法
/usr/bin/pip3 install -i https://pypi.tuna.tsinghua.edu.cn/simple opencv-contrib-python==3.4.2.16

Docker 安装

上述复杂的操作我并没有成功运行 opensfm，环境配置复杂，Docker 更加简单易用

下载opensfm仓库

由于 opensfm 分支太多，以防万一手动切换到 v0.5.1

git clone git@github.com:mapillary/OpenSfM.git
cd OpenSfM
git checkout v0.5.1
git submodule update --init --recursive

创建 Docker 镜像

仓库中有构建 OpenSfM 运行环境的 Dockerfile，需要在 OpenSfM 仓库下运行命令来构建镜像：

1	`docker build -t sfm:vvd .`

成功后可以查看当前的 docker images

> docker images
REPOSITORY                         TAG                       IMAGE ID       CREATED         SIZE
sfm                                vvd                       187a50a0777e   16 hours ago    1.84GB
...

创建容器

1	`docker run -it --name mysfm sfm:vvd bash`

测试

1	`bin/opensfm_run_all data/berlin`

在 data/berlin 文件夹会生成相关结果文件，说明一切顺利。

Pipeline 调用

将自己的图像数据放在 data/customer/images/ 文件夹下，sfm 流程调用命令：

1	`bin/opensfm_run_all data/customer`

结果数据结构

project/
├── config.yaml
├── images/
   ├── image_filename
├── masks/
   ├── image_filename.png
├── gcp_list.txt
├── exif/
├── camera_models.json
├── features/
├── matches/
├── tracks.csv
├── reconstruction.json
├── reconstruction.meshed.json
└── undistorted/
    ├── images/
       ├── image_filename
    ├── masks/
       ├── image_filename.png
    ├── tracks.csv
    ├── reconstruction.json
    └── depthmaps/
        └── merged.ply
 └── stats/
    ├── stats.json
    ├── report.pdf
    ├── topview.png
    ├── matchgraph.png
    ├── heatmap_XXX.png
    ├── residuals_XXX.png

reconstruction.json

OpenSfM 的主要输出是包含摄像机参数估计值、摄像机位置估计值和稀疏重建三维点集的重建。这些数据存储在 restruction.json 文件中。其格式如下:

reconstruction.json: [RECONSTRUCTION, ...]

RECONSTRUCTION: {
    "cameras": {
        CAMERA_ID: CAMERA,
        ...
    },
    "shots": {
        SHOT_ID: SHOT,
        ...
    },
    "points": {
        POINT_ID: POINT,
        ...
    }
}

CAMERA: {
    "projection_type": "perspective",  # Can be perspective, brown, fisheye or equirectangular
    "width": NUMBER,                   # Image width in pixels
    "height": NUMBER,                  # Image height in pixels

    # Depending on the projection type more parameters are stored.
    # These are the parameters of the perspective camera.
    "focal": NUMBER,                   # Estimated focal length
    "k1": NUMBER,                      # Estimated distortion coefficient
    "k2": NUMBER,                      # Estimated distortion coefficient
}

SHOT: {
    "camera": CAMERA_ID,
    "rotation": [X, Y, Z],      # Estimated rotation as an angle-axis vector
    "translation": [X, Y, Z],   # Estimated translation
    "gps_position": [X, Y, Z],  # GPS coordinates in the reconstruction reference frame
    "gps_dop": METERS,          # GPS accuracy in meters
    "orientation": NUMBER,      # EXIF orientation tag (can be 1, 3, 6 or 8)
    "capture_time": SECONDS     # Capture time as a UNIX timestamp
}

POINT: {
    "coordinates": [X, Y, Z],      # Estimated position of the point
    "color": [R, G, B],            # Color of the point
}

代码调试

不知道别人一般都是怎么熟悉、理解一个新项目的，对我来说如果可能的话调试代码是最好的学习方法，一切尽在掌握

这里讲 Docker 环境下的调试方法：

创建容器

过程中需要将本地图像数据映射到容器中，映射端口用于 ssh

1	`docker run -it --name mysfm -v ~/datasets/:/workspace -p 9000-9100:9000-9100 sfm:vvd bash`

配置环境

进入容器

1	`docker exec -it mysfm bash`

安装 ssh

1 2	`apt update apt install openssh-server`

配置 ssh

1 2	`apt install vim vim /etc/ssh/sshd_config`

修改 port 为 9000

1 2	`PermitRootLogin yes PasswordAuthentication yes`

重启 ssh 服务

1	`/etc/init.d/ssh restart`

修改 root 密码

1	`passwd root`

ssh 进入容器

1	`ssh root@ip -p 9000`

VS code 连接容器
进入 /source/OpenSfM 文件夹

创建 launch.json

{
    // Use IntelliSense to learn about possible attributes.
    // Hover to view descriptions of existing attributes.
    // For more information, visit: https://go.microsoft.com/fwlink/?linkid=830387
    "version": "0.2.0",
    "configurations": [
        {
            "name": "Python Debugger: Current File",
            "type": "debugpy",
            "request": "launch",
            "program": "${file}",
            "console": "integratedTerminal",
            "justMyCode": false,
            "args": ["extract_metadata", "/workspace/yourdataset"]
        }
    ]
}

直接运行 /bin/opensfm

配置文件

每次运行opensfm生成点云，不仅需要原始图片数据，还需要一个配置文件config.yaml，文件结构如下：

lab
├── config.yaml
└── images
    ├── DJI_1_0239.JPG
    ├── DJI_1_0240.JPG
    ├── DJI_1_0242.JPG
    └── DJI_1_0268.JPG
 
1 directory, 5 files

配置文件的默认选项如下，见链接opensfm.org

# Metadata
use_exif_size: yes
default_focal_prior: 0.85
 
# Params for features
feature_type: HAHOG           # Feature type (AKAZE, SURF, SIFT, HAHOG, ORB)
feature_root: 1               # If 1, apply square root mapping to features
feature_min_frames: 4000      # If fewer frames are detected, sift_peak_threshold/surf_hessian_threshold is reduced.
feature_process_size: 2048    # Resize the image if its size is larger than specified. Set to -1 for original size
feature_use_adaptive_suppression: no
 
# Params for SIFT
sift_peak_threshold: 0.1     # Smaller value -> more features
sift_edge_threshold: 10       # See OpenCV doc
 
# Params for SURF
surf_hessian_threshold: 3000  # Smaller value -> more features
surf_n_octaves: 4             # See OpenCV doc
surf_n_octavelayers: 2        # See OpenCV doc
surf_upright: 0               # See OpenCV doc
 
# Params for AKAZE (See details in lib/src/third_party/akaze/AKAZEConfig.h)
akaze_omax: 4                      # Maximum octave evolution of the image 2^sigma (coarsest scale sigma units)
akaze_dthreshold: 0.001            # Detector response threshold to accept point
akaze_descriptor: MSURF            # Feature type
akaze_descriptor_size: 0           # Size of the descriptor in bits. 0->Full size
akaze_descriptor_channels: 3       # Number of feature channels (1,2,3)
akaze_kcontrast_percentile: 0.7
akaze_use_isotropic_diffusion: no
 
# Params for HAHOG
hahog_peak_threshold: 0.00001
hahog_edge_threshold: 10
hahog_normalize_to_uchar: yes
 
# Params for general matching
lowes_ratio: 0.8              # Ratio test for matches
matcher_type: FLANN           # FLANN, BRUTEFORCE, or WORDS
symmetric_matching: yes       # Match symmetricly or one-way
 
# Params for FLANN matching
flann_branching: 8           # See OpenCV doc
flann_iterations: 10          # See OpenCV doc
flann_checks: 20             # Smaller -> Faster (but might lose good matches)
 
# Params for BoW matching
bow_file: bow_hahog_root_uchar_10000.npz
bow_words_to_match: 50        # Number of words to explore per feature.
bow_num_checks: 20            # Number of matching features to check.
bow_matcher_type: FLANN       # Matcher type to assign words to features
 
# Params for VLAD matching
vlad_file: bow_hahog_root_uchar_64.npz
 
# Params for matching
matching_gps_distance: 150            # Maximum gps distance between two images for matching
matching_gps_neighbors: 0             # Number of images to match selected by GPS distance. Set to 0 to use no limit (or disable if matching_gps_distance is also 0)
matching_time_neighbors: 0            # Number of images to match selected by time taken. Set to 0 to disable
matching_order_neighbors: 0           # Number of images to match selected by image name. Set to 0 to disable
matching_bow_neighbors: 0             # Number of images to match selected by BoW distance. Set to 0 to disable
matching_bow_gps_distance: 0          # Maximum GPS distance for preempting images before using selection by BoW distance. Set to 0 to disable
matching_bow_gps_neighbors: 0         # Number of images (selected by GPS distance) to preempt before using selection by BoW distance. Set to 0 to use no limit (or disable if matching_bow_gps_distance is also 0)
matching_bow_other_cameras: False     # If True, BoW image selection will use N neighbors from the same camera + N neighbors from any different camera.
matching_vlad_neighbors: 0            # Number of images to match selected by VLAD distance. Set to 0 to disable
matching_vlad_gps_distance: 0         # Maximum GPS distance for preempting images before using selection by VLAD distance. Set to 0 to disable
matching_vlad_gps_neighbors: 0        # Number of images (selected by GPS distance) to preempt before using selection by VLAD distance. Set to 0 to use no limit (or disable if matching_vlad_gps_distance is also 0)
matching_vlad_other_cameras: False    # If True, VLAD image selection will use N neighbors from the same camera + N neighbors from any different camera.
matching_use_filters: False           # If True, removes static matches using ad-hoc heuristics
 
# Params for geometric estimation
robust_matching_threshold: 0.004        # Outlier threshold for fundamental matrix estimation as portion of image width
robust_matching_calib_threshold: 0.004  # Outlier threshold for essential matrix estimation during matching in radians
robust_matching_min_match: 20           # Minimum number of matches to accept matches between two images
five_point_algo_threshold: 0.004        # Outlier threshold for essential matrix estimation during incremental reconstruction in radians
five_point_algo_min_inliers: 20         # Minimum number of inliers for considering a two view reconstruction valid
five_point_refine_match_iterations: 10  # Number of LM iterations to run when refining relative pose during matching
five_point_refine_rec_iterations: 1000  # Number of LM iterations to run when refining relative pose during reconstruction
triangulation_threshold: 0.006          # Outlier threshold for accepting a triangulated point in radians
triangulation_min_ray_angle: 1.0        # Minimum angle between views to accept a triangulated point
triangulation_type: FULL                # Triangulation type : either considering all rays (FULL), or sing a RANSAC variant (ROBUST)
resection_threshold: 0.004              # Outlier threshold for resection in radians
resection_min_inliers: 10               # Minimum number of resection inliers to accept it
 
# Params for track creation
min_track_length: 2             # Minimum number of features/images per track
 
# Params for bundle adjustment
loss_function: SoftLOneLoss     # Loss function for the ceres problem (see: http://ceres-solver.org/modeling.html#lossfunction)
loss_function_threshold: 1      # Threshold on the squared residuals.  Usually cost is quadratic for smaller residuals and sub-quadratic above.
reprojection_error_sd: 0.004    # The standard deviation of the reprojection error
exif_focal_sd: 0.01             # The standard deviation of the exif focal length in log-scale
principal_point_sd: 0.01        # The standard deviation of the principal point coordinates
radial_distorsion_k1_sd: 0.01   # The standard deviation of the first radial distortion parameter
radial_distorsion_k2_sd: 0.01   # The standard deviation of the second radial distortion parameter
radial_distorsion_k3_sd: 0.01   # The standard deviation of the third radial distortion parameter
radial_distorsion_p1_sd: 0.01   # The standard deviation of the first tangential distortion parameter
radial_distorsion_p2_sd: 0.01   # The standard deviation of the second tangential distortion parameter
bundle_outlier_filtering_type: FIXED    # Type of threshold for filtering outlier : either fixed value (FIXED) or based on actual distribution (AUTO)
bundle_outlier_auto_ratio: 3.0          # For AUTO filtering type, projections with larger reprojection than ratio-times-mean, are removed
bundle_outlier_fixed_threshold: 0.006   # For FIXED filtering type, projections with larger reprojection error after bundle adjustment are removed
optimize_camera_parameters: yes         # Optimize internal camera parameters during bundle
bundle_max_iterations: 100      # Maximum optimizer iterations.
 
retriangulation: yes                # Retriangulate all points from time to time
retriangulation_ratio: 1.2          # Retriangulate when the number of points grows by this ratio
bundle_interval: 999999             # Bundle after adding 'bundle_interval' cameras
bundle_new_points_ratio: 1.2        # Bundle when the number of points grows by this ratio
local_bundle_radius: 3              # Max image graph distance for images to be included in local bundle adjustment
local_bundle_min_common_points: 20  # Minimum number of common points betwenn images to be considered neighbors
local_bundle_max_shots: 30          # Max number of shots to optimize during local bundle adjustment
 
save_partial_reconstructions: no    # Save reconstructions at every iteration
 
# Params for GPS alignment
use_altitude_tag: no                  # Use or ignore EXIF altitude tag
align_method: orientation_prior       # orientation_prior or naive
align_orientation_prior: horizontal   # horizontal, vertical or no_roll
bundle_use_gps: yes                   # Enforce GPS position in bundle adjustment
bundle_use_gcp: no                    # Enforce Ground Control Point position in bundle adjustment
 
# Params for navigation graph
nav_min_distance: 0.01                # Minimum distance for a possible edge between two nodes
nav_step_pref_distance: 6             # Preferred distance between camera centers
nav_step_max_distance: 20             # Maximum distance for a possible step edge between two nodes
nav_turn_max_distance: 15             # Maximum distance for a possible turn edge between two nodes
nav_step_forward_view_threshold: 15   # Maximum difference of angles in degrees between viewing directions for forward steps
nav_step_view_threshold: 30           # Maximum difference of angles in degrees between viewing directions for other steps
nav_step_drift_threshold: 36          # Maximum motion drift with respect to step directions for steps in degrees
nav_turn_view_threshold: 40           # Maximum difference of angles in degrees with respect to turn directions
nav_vertical_threshold: 20            # Maximum vertical angle difference in motion and viewing direction in degrees
nav_rotation_threshold: 30            # Maximum general rotation in degrees between cameras for steps
 
# Params for image undistortion
undistorted_image_format: jpg         # Format in which to save the undistorted images
undistorted_image_max_size: 100000    # Max width and height of the undistorted image
 
# Params for depth estimation
depthmap_method: PATCH_MATCH_SAMPLE   # Raw depthmap computation algorithm (PATCH_MATCH, BRUTE_FORCE, PATCH_MATCH_SAMPLE)
depthmap_resolution: 640              # Resolution of the depth maps
depthmap_num_neighbors: 10            # Number of neighboring views
depthmap_num_matching_views: 6        # Number of neighboring views used for each depthmaps
depthmap_min_depth: 0                 # Minimum depth in meters.  Set to 0 to auto-infer from the reconstruction.
depthmap_max_depth: 0                 # Maximum depth in meters.  Set to 0 to auto-infer from the reconstruction.
depthmap_patchmatch_iterations: 3     # Number of PatchMatch iterations to run
depthmap_patch_size: 7                # Size of the correlation patch
depthmap_min_patch_sd: 1.0            # Patches with lower standard deviation are ignored
depthmap_min_correlation_score: 0.1   # Minimum correlation score to accept a depth value
depthmap_same_depth_threshold: 0.01   # Threshold to measure depth closeness
depthmap_min_consistent_views: 3      # Min number of views that should reconstruct a point for it to be valid
depthmap_save_debug_files: no         # Save debug files with partial reconstruction results
 
# Other params
processes: 1                  # Number of threads to use
 
# Params for submodel split and merge
submodel_size: 80                                                    # Average number of images per submodel
submodel_overlap: 30.0                                               # Radius of the overlapping region between submodels
submodels_relpath: "submodels"                                       # Relative path to the submodels directory
submodel_relpath_template: "submodels/submodel_%04d"                 # Template to generate the relative path to a submodel directory
submodel_images_relpath_template: "submodels/submodel_%04d/images"   # Template to generate the relative path to a submodel images directory

参考资料

文章链接：
https://www.zywvvd.com/notes/3d/opensfm/opensfm/

“觉得不错的话，给点打赏吧 ୧(๑•̀⌄•́๑)૭”

微信支付

支付宝支付

#3D

OpenSfM

https://www.zywvvd.com/notes/3d/opensfm/opensfm/

作者

Yiwei Zhang

发布于

2024年9月30日

许可协议

鱼眼相机原理上一篇

基于图像的大规模场景三维重建下一篇

简介