The present invention relates to an adaptive mobile manipulation apparatus and method, and more particularly to an adaptive mobile manipulation apparatus and method utilizing a plurality of markers.
Material handling and logistics are two important tasks in warehouses and factories. These tasks are usually performed by manpower, which leads to safety risks and operation costs. With the development of mobile manipulator, it is now possible to automate these tasks. However, challenges remain.
The first challenge is the navigation of a mobile manipulator. With the help of LASER range scanners, or LIDAR, and advanced control algorithms, an Automated Guided Vehicle (AGV) is now able to move to a target location. However, the accuracy is about 10 centimeters in position and about 10 degrees in orientation.
The second challenge is the localization of the target object or manipulating area, which involves estimating its pose, including the position and the orientation thereof. Techniques such as computer vision and machine learning are capable of doing this in limited conditions such as good lighting. However, due to the placement of camera(s) on the mobile manipulator and various lighting conditions in the warehouse or factory, the outcome is not stable. Besides, these techniques are computationally intensive and not suitable for a mobile manipulator, which has limited battery power and computational power. If the manipulation scenario changes, the mathematical models also have to be re-established. Techniques using squared planar fiducial markers such as ArUco and ALVAR are popular methods used to detect the markers' poses by placing the marker on the object. Given the size of a square marker, the position and orientation of the marker, namely, the pose of the marker, can then be determined by its size and shape in the camera image. The resulting position estimation is accurate (usually within one or two millimeters), but the orientation estimation heavily depends on environmental conditions such as lighting and fluctuates within a short period of time.
The third challenge is about motion planning. This includes moving the mobile manipulator to a specific location and using the manipulator to perform manipulation task. Traditionally, “teaching” is the technique used on production line for a fixed manipulator to perform repetitive tasks such as pick and place, screwing. An engineer guides and programs the manipulator through a sequence of movements that represent the task. However, due to the position and orientation errors from moving the mobile platform (AGV), there exists a position offset and an orientation offset from the manipulator to the target and this makes traditional “teaching” technique not suitable for a mobile manipulator.
In addition to the above-mentioned challenges, artificial intelligence and machine learning are popular techniques for solving the above-mentioned problems in academic research. However, it can be practically infeasible for small businesses to have a team of researchers focused on this due to the financial cost. It is appropriate to provide a low-cost framework to solve this problem.
Therefore, there is a need of providing an adaptive mobile manipulation apparatus and method distinct from prior art in order to solve the above drawbacks.
The present disclosure provides an adaptive mobile manipulation apparatus and method in order to overcome at least one of the above-mentioned drawbacks.
The present disclosure also provides an adaptive manipulation method which classifies the actions for object manipulation into pose-aware actions and non-pose-aware actions and further associates the pose-aware actions with localization information obtained by detecting the markers, and thus, the pose-aware actions with high accuracy can be achieved through a low-cost framework of an adaptive mobile manipulation apparatus.
In accordance with an aspect of the present disclosure, an adaptive manipulation method is provided. The adaptive manipulation method includes steps of providing a mobile manipulation apparatus including a manipulator, a sensor and a processor for a manipulation of an object placed on a carrier having a plurality of markers spaced apart from each other, providing a base-case motion plan including a plurality of first pose-aware actions, the sensor detecting the plurality of markers to obtain a run time marker information, the processor, according to the base-case motion plan, generating a run time motion plan, wherein the run time motion plan includes a plurality of second pose-aware actions, and the plurality of second pose-aware actions are modified from the plurality of first pose-aware actions according to the run time marker information, and the processor further executing the run time motion plan for controlling the manipulator to manipulate the object.
In an embodiment, each of the first pose-aware actions of the base-case motion plan includes variables and a base-case marker information corresponding to the plurality of markers.
In an embodiment, the method further includes steps of the processor calculating a difference between the base-case marker information and the run time marker information, and the processor generating the plurality of second pose-aware actions according to the plurality of first pose-aware actions and the difference.
In an embodiment, both the run time marker information and the base-case marker information include positions and orientations between the plurality of markers and the sensor.
In an embodiment, the manipulator further includes an end effector and a joint. The first and the second pose-aware actions include moving the end effector by position and orientation relative to the object. The first and the second pose-aware actions respectively further include at least one of the following: moving the end effector to a target pose, traversing the end effector through a trajectory, and moving the end effector associating with the run time marker information.
In an embodiment, the object is placed at a fixed location on the carrier. In another embodiment, the markers include visual markers or fiducial markers. In further another embodiment, the sensor includes a camera.
In accordance with another aspect of the present invention, an adaptive manipulation apparatus is provided. The adaptive mobile manipulation apparatus includes a manipulator, a sensor, and a processor. The processor is coupled to the manipulator and the sensor, and configured to perform the following steps: retrieving a base-case motion plan including a plurality of first pose-aware actions, driving the sensor to detect a plurality of markers located on a carrier to obtain a run time marker information, according to the base-case motion plan, generating a run time motion plan, wherein the run time motion plan includes a plurality of second pose-aware actions, and the plurality of second pose-aware actions are modified from the plurality of first pose-aware actions according to the run time marker information, and executing the run time motion plan for controlling the manipulator to manipulate an object placed on the carrier.
The above contents of the present disclosure will become more readily apparent to those ordinarily skilled in the art after reviewing the following detailed description and accompanying drawings, in which:
The present disclosure will now be described more specifically with reference to the following embodiments. It is to be noted that the following descriptions of preferred embodiments of this disclosure are presented herein for purpose of illustration and description only. It is not intended to be exhaustive or to be limited to the precise form disclosed.
The present disclosure is to provide a framework for object manipulation (picking up, placing, or modifying objects etc.) in the settings of warehouses or factory production lines, such that an engineer or operator can easily design a motion plan with affordable financial cost.
Four parts of the details of the present disclosure, including (1) the design of system, (2) the architecture of an adaptive manipulation apparatus, (3) the design of a teaching-based adaptive mobile manipulation, and (4) the algorithms used to obtain localization information from multiple markers, will be described as following.
(1) The Design of System
The system includes the physical setup of the environment, including an adaptive mobile manipulation apparatus, a carrier for placing a target object, and markers placed on the carrier and spaced apart from each other. Generally, the carrier is the rigid body shelf in the warehouse or factory and different shelves are distinguished by their identification numbers, i.e. shelf ID. Please refer to
The carrier 121 is specially designed to house the target object 122. Hence, it is assumed that the relative poses among the carrier 121, the target object 122, the collision objects 123 and the markers 124, 125 are fixed. The poses of other components can be calculated from the pose of the carrier 121 if the latter is known. Two markers 124, 125 are positioned horizontally on the carrier 121 facing approximately in the same direction at the approximately same height from the ground 101. For best result, it is suggested that (a) the marker is at least 35 mm in size, (b) additional white-color border of at least 3 mm wide surrounding each marker, and (c) 100 mm apart from each other measured from their centers.
Although there are only one adaptive mobile manipulation apparatus and only one carrier are shown in
The objective of a “manipulation task” is to move the mobile manipulation apparatus to a location near the carrier 121 and perform manipulation at the target object 122. Hence, before performing manipulation, the AGV component 111 of the adaptive mobile manipulation apparatus moves to a pose nearby the carrier 121 with a specific pair of markers within the effective view volume 115 of the sensor 114 of the mobile manipulation apparatus, and the target object 122 is within the reach of the manipulator 112 using existing navigation techniques.
Accordingly, the following reasonable assumptions are made. Firstly, the manipulation task is divided into two steps—navigation (using the AGV component 111) and manipulation (using the manipulator 112). Secondly, the AGV component 111 is able to navigate to a target position and orientation accurately enough such that the target object 122 is within the reach of the manipulator 112. However, a margin of error (position and orientation offsets) is allowed. Lastly, the target object 122 is placed on a specially designed carrier 121 and hence the related poses of the target object 122 are fixed with the carrier 121. In other words, with the knowledge of the pose of the carrier 121, the pose of the target object 122 can be calculated.
(2) The Basic Architecture of an Adaptive Mobile Manipulation Apparatus
Please refer to
(3) Teaching-Based Adaptive Manipulation
(3.1) Motion Plan
Following the paragraphs described above, the manipulation task is defined as using the manipulator to manipulate objects without direct physical contact by the human labor after the adaptive mobile manipulation apparatus has reached the pose for manipulation. The manipulation task includes a series of manipulation actions and can be defined as a “motion plan”. Please refer to
Without loss of generality, only the actions listed below in the Table I are considered in the present disclosure. The possible actions in a motion plan are classified by pose-awareness. The actions which are classified as pose-aware are those directly related to the manipulation of the target object after the manipulator and the EFF are within a range capable of reaching the target object. Note that it is possible to have actions with online adjustment using wrist camera or other sensors, which can be combined with this framework.
(3.2) Creating a Base-Case Motion Plan Using “Teaching” and its Implementation
Please refer to
However, if a motion plan for such manipulation task for a specific mobile manipulation pose configuration is provided, this motion plan is able to be modified for other different mobile manipulation pose configurations. This motion plan is defined as the “base-case motion plan” and the purpose of “teaching” is to create it. A base-case motion plan can be designed manually, computationally, or through teaching. This section covers the process to create the base-case motion plan using teaching and its implementation.
(3.2.1) Environmental Setup
In order to apply position and orientation fixes to adjust actions in the base-case motion plan, additional information needs to be captured for the base-case motion plan. In the present disclosure, each pose-aware action is associated with a pair of fiducial markers. Hence, as shown in
(3.2.2) Base-Case Motion Plan and its Digital Representation
Given an arbitrary mobile manipulator pose, a motion plan to perform the manipulation task can be modified into a base-case motion plan by adding marker information to each pose-aware action. Hence, to create a base-case motion plan, an additional step is required to detect the pair of markers to obtain base-case marker information for being associated with each pose-aware action. This can be done by using squared fiducial marker technique. Such technique can provide a stream of estimated poses (position and orientation) of a marker using the image frames come from an RGB camera. However, the values could be fluctuating, and a filter that can be applied to the pose stream is presented in a latter section and is designed to improve the detections. The following Table II shows the data structures used in actions and how different types of variables are represented in a computer system.
Base on the paragraphs described above, the motion plan defined earlier for a base-case motion plan can be extended as an ordered list of actions with positions of a pair of square fiducial markers stored if the action is pose-aware. The details of the data structures used for a base-case motion plan in a computer system are illustrated below:
Base case motion plan=[a]
a=PoseEFF|Trajectory|Offsettarget|JS|Offsetmanipulator|Actionother
PoseEFF=((p,q), (pL, pR))
Trajectory=([(p,q)], (pL, pR))
Offsettarget=((x,y,z),(pL,pR))
JS=[j]
Offsetmanipulator=(x,y,z)
p,pL,pR=(x,y,z)
q=(rx, ry, rz, rw)
x,y,z,rx,ry,rz,rw,j are real numbers.
The notations used in former paragraphs are as follows:
[u]: an ordered list of “u”
a: an action
|: or
PoseEFF: EFF Pose
Trajectory: EFF Trajectory
Offsettarget: EFF movement offset along target object's coordinate
JS: manipulator's joint state
Offsetmanipulator: EFF movement offset along manipulator's coordinate
Actionother: other actions that does not affect the manipulator state
p,pL,pR: position, position of left marker, position of right marker
q: orientation (Euler angles or quaternion)
Note that without loss of generality, it is assumed the origin (0, 0, 0) and the world coordinate system are aligned with the base of the manipulator.
(3.2.3) Create Base-Case Motion Plan Using Teaching
When doing manipulator onboard programming, teaching is used to specify a state (mostly a joint state) by moving the manipulator to a desired configuration instead of giving the values of this joint state. In the present disclosure, this concept is extended further to the entire motion plan and the user guides the manipulator through a series of actions during this process.
Please refer to
If the action is a pose-aware action, corresponding variables from Table II are collected in step 604 and stored into “curr_action” along with the type of the action. In this step, these variables can be collected directly from the manipulator after the user operates the manipulator to the desired pose. Next step 605 is to collect the base-case marker information from left and right markers using known technique described earlier and the base-case marker information is stored in “curr_action” along with marker IDs, in which marker IDs are given by the user. An algorithm that collects a series of samples and applies a filter to filter out extreme value for providing better values is presented in Section 4.1. Then, “curr_action” is appended to the end of “actions” in step 606. Accordingly, the pose-aware actions associated with the base-case marker information for the base-case motion plan are created and defined as first pose-aware actions.
Similarly, if the action is a non-pose-aware action, corresponding variables from Table II is collected in step 607 and stored into “curr_action” along with the type of the action. In this step, these variables can be collected (1) directly from the manipulator after the user operates the manipulator to the desired joint state, and (2) through user keyboard input (for example, EFF position offset, close/open the gripper, or other options). The system performs corresponding action upon receiving user input, then “curr_action” is appended to the end of “actions” in step 608. Accordingly, the non-pose-aware actions for the base-case motion plan are created and defined as first non-pose-aware actions.
If it indicates that the user has finished creating the base-case motion plan, “actions” is then flattened as a string data structure and stored with a unique name specified by the user for later use in step 609. Then the process moves to the finish state 610.
(3.3) Adjusting Base-Case Motion Plan for Run Time Scenario
Please refer to
That is, the run time motion plan is modified from the base-case motion plan. First, according to the run time marker information obtained by the sensor, the first pose-aware actions of the base-case motion plan are modified into different pose-aware actions, which are defined as second pose-aware actions in the run time motion plan. Further, the first non-pose-aware actions of the base-case motion plan are not modified and executed directly at run time, which are defined as second non-pose-aware actions in the run time motion plan.
Accordingly, in summary, the process for the adaptive mobile manipulation apparatus to manipulate the target object is as shown in
(4) Algorithms for Modifying Base-Case Motion Plan
(4.1) Filtering Algorithm for Getting Stable Marker Positions
Please refer to
The filtering algorithm in step 905 is presented below.
The notations used in former paragraphs are as follows:
M: the set of markers to be localized
m: marker m
Pm: the k samples for a specific marker m
pmi: i-th sample in Pm with position (xmi, ymi, zmi)
Cm: final position for marker m
Other notations for temporal variables are self-explanatory
The output from the filtering algorithm is then used to modify the base-case motion plan.
Note that by placing 3 markers in an L shape (or more markers), it will be able to determine position offset in 3-dimension as well as Roll/Pitch/Yaw. Above information can then be used to handle the case that the height of the shelf is changed. The motion plan modification is similar.
(4.2) Algorithm for Getting Position and Orientation Offsets Between Base-Case Motion Plan and Run Time Motion Plan
In the base-case motion plan, the base-case marker information is associated with each action. This along with the run time marker information detected at run time is used to calculate the position offset and orientation offset and to be applied to modify the base-case motion plan, in which the first pose-aware actions are accordingly modified into the second pose-aware actions. Please refer to
Please refer to
Input:
a=(xa,ya,za), b=(xb,yb,zb), c=(xc,yc,zc), d=(xd,yd,zd)
Position Offset:
(Δx,Δy,0)=(xc−xa,yc−ya,0)
Orientation Offset:
where:
r=(xr,yr)=(xb−xa,yb−ya)
s=(xs,ys)=(xd−xc,yd−yc)
r·s=x
r
x
s
+y
r
y
s
|r|·|s|=√{square root over (xr2+yr2)}+√{square root over (xs2+ys2)}
(4.3) Algorithms for Modifying Base-Case Motion Plan
With position offset (Δx, Δy, 0) and orientation offset thetaz, the base-case motion plan can now be adjusted into the run time motion plan for performing the manipulation. Only pose-aware actions in the motion plan are required to be modified, at least including “moving EFF to a pose”, “EFF traversing through a trajectory” and “EFF moving with position offset with respect to target's coordinate” (refer to Table I). The calculations of adjustment are described in Sections 4.3.1 and 4.3.2.
(4.3.1) EFF Pose and Trajectory
For action type “moving EFF to a pose”, a single EFF pose needs to be modified. On the other hand, “EFF traversing through a trajectory” action contains a series of EFF poses and each needs to be recalculated. Hence both can use the same algorithm to calculate new target EFF pose as presented below.
Input:
EFF Pose in Base-Case Motion Plan
pose(p,q),p=(x,y,z),q=(qx,qy,qz,qw)
Marker Information in Base-Case Motion Plan
l=(xl,yl,zl), r=(xr,yr,zr)
Algorithm:
p′=(xp′,yp′,zp)=(x−xa,y−ya,z)
p″=(xp′cos θ−yp′sin θ,xp′,sin θ+yp′cos θ,zp)=(xp″,yp″,zp)
p
n=(xp″+xa+Δx,yp″+ya+Δy,zp)
q
n
=q
r
×q
Output:
(4.3.2) Movement Position Offset
Action type “EFF moving with position offset with respect to target's coordinate” can be calculated using the equations calculating of new EFF movement offset as following.
Input:
EFF Movement in Base-Case Motion Plan
Δ=(Δx,Δy,Δz)
Rotation (about Z axis) θ=thetaz
Algorithm:
New EFF movement Δ′=(Δ′x,Δ′y,Δ′z)
where: Δ′x=Δx cos θ−Δy sin θ
Δ′y=Δx sin θ+Δy cos θ
Δ′z=Δz
To summarize, this framework provides a process to create a base-case motion plan according to the base-case marker information. Then, with the run time marker information obtained through the method using two squared fiducial markers provided in the present disclosure, the base-case motion plan can be adjusted into the run time motion plan using the methods provided to compensate both position and orientation offsets.
In brief, the present disclosure has the following advantages:
1. Low cost: the cost for setup the system is affordable, which includes RGB camera and cost to print the markers.
2. Easy deployment: markers can be easily deployed to the field within the view of camera, and there is no requirement on measurement and alignment.
3. Accuracy: multi-marker system from this disclosure provides good accuracy on finding position offset and orientation offset with respect to the base-case motion plan.
4. A method (“teaching”) for creating the base-case motion plan makes it realistic to be adopted in the industrial without a research team.
5. Local information for manipulation: only local information for manipulation is used and stored in this disclosure, which is relatively cheaper than constructing an accurate global 3D environmental map and makes re-arrangement of the environmental settings easy.
From the above descriptions, the present disclosure provides an adaptive mobile manipulation method which classifies the actions for object manipulation into pose-aware actions and non-pose-aware actions and further associates the pose-aware actions with localization information obtained by detecting the markers, and thus, the pose-aware actions with high accuracy can be achieved through a low-cost framework of an adaptive manipulation apparatus.
While the invention has been described in terms of what is presently considered to be the most practical and preferred embodiments, it is to be understood that the invention needs not be limited to the disclosed embodiment. On the contrary, it is intended to cover various modifications and similar arrangements included within the spirit and scope of the appended claims which are to be accorded with the broadest interpretation so as to encompass all such modifications and similar structures.
This application claims the benefit of U.S. Provisional Application Ser. No. 63/217,109 filed on Jun. 30, 2021, the disclosure of which is incorporated herein by reference.
Number | Date | Country | |
---|---|---|---|
63217109 | Jun 2021 | US |