Before doing any steps on this page, please make sure you looked at Identify_Ceph_Latency_Issues
IMPORTANT: This will be a mix of commands that need to be run on the host(s) running the MDS daemon(s) and other commands that can be run from any of the ceph-mon nodes.
NOTICE: These steps are based off upstream documentation. This can be viewed here.  https://docs.ceph.com/en/octopus/cephfs/troubleshooting/.
Please ensure you are on the correct version of documentation for the cluster you are running.
If you are here, then we are going to assume the following:
Procedure:
Identify the active MDS
ncn-s00(1/2/3):~ # ceph fs status -f json-pretty|jq -r '.mdsmap[]|select(.state=="active")|.name'
cephfs.ncn-s003.ihwkop
ssh to the host running the active MDS
Enter into a cephadm shell
ncn-s003:~ # cephadm shell
Inferring fsid 7350865a-0b21-11ec-b9fa-fa163e06c459
Inferring config /var/lib/ceph/7350865a-0b21-11ec-b9fa-fa163e06c459/mon.ncn-s003/config
Using recent ceph image arti.dev.cray.com/third-party-docker-stable-local/ceph/   ceph@sha256:70536e31b29a4241999ec4fd13d93e5860a5ffdc5467911e57e6bf04dfe68337
[ceph: root@ncn-s003 /]#
NOTE: You may see messages like this “WARNING: The same type, major and minor should not be used for multiple devices.” They can be ignored. There is an upstream bug to address this.
Dump in flight ops
[ceph: root@ncn-s003 /]# ceph daemon mds.cephfs.ncn-s003.ihwkop dump_ops_in_flight
{
    "ops": [],
    "num_ops": 0
}
NOTE: The example above is about how to run the command. Recreating the exact scenario to provide a full example is not easily done. This will be updated when the information is available.
General Steps from upstream: