MaaS Notes Installation LXD based maas is so far the best solution. Follow official guide lxc install maas and mass installation. Few steps to install: Create dedicated lxd env for maas, including network and storage pool. maas init to create admin user. Login https://{MAAS}:5240/MAAS Setup user public key injection for bare metal commissioning. Commision nodes and setup networks. Deploy. Storage Preparation Volume can be ZFS/LVM/btrfs: Create lvm pool.

Continue reading

OVS traffic capture OVS traffic flow illustration(kolla example): traffic to go out of cloud via provider network VM –> tap+qbr(linuxbridge)+qvb –> qvo+br-int+int-br-ex –> phy-br-ex+br-ex+br_vlan –> external network traffic to go to vxlan tenant VM –> tap+qbr(linuxbridge)+qvb –> qvo+br-int+patch-tun –> patch-int+br-tun+port vxlan# –> remote host vxlan if ip if no DVR used, then all traffic will go to neutron nodes from compute nodes then use neutron nodes’ port# to go out.

Continue reading

To enable onboard Horizontal Autoscaling feature, a Metric Server needs to be installed first for k8s to pull resource data from. helm install stable/metrics-server -n metric --namespace kube-system -f metric.yml Metric Server has a chart on Helm stable, but somehow new version of it behaves weirdly, it shows error as: unable to fetch pod metrics for pod rook-ceph/csi-rbdplugin-qv94k: no metrics known for pod When this happens, it means you are facing some TLS and network issues.

Continue reading

How to - Ceph - Identify the server drive bay number of a faulty drive To identify a faulty disk is in which drive bay: Method 1 - Using iLO and iDRAC Login to the iLO or iDRAC interface Check for error messages in the iLO or iDRAC. If iLO (HP), from the main page, go to Information → System Information → Storage → Physical View

Continue reading

How To Replace Ceph Osd

How to - Ceph - Configure Ceph on a new drive source: https://ceph.com/geen-categorie/admin-guide-replacing-a-failed-disk-in-a-ceph-cluster/ Remove the OSD of the faulty drive If you are replacing a faulty drive with a new one, you will need to remove the OSD of the faulty drive before proceeding with creating the new OSD. *Requirement: The faulty SSD must have been replaced with a healthy SSD. Login to the Ceph node with the faulty drive.

Continue reading

How to properly remove a node from cluster Find your node then drain it to let k8s reschedule pods and avoid future schedule on this node: kubectl drain <node-name> --ignore-daemonsets --delete-local-data Then you'll fine node.kubernetes.io/unschedulable=NoSchedule label on this node. Delete node from cluster. kubectl delete node <node-name> Then everything k8s related will be removed, and you'll only see this left on node: t login: Fri Dec 6 05:25:27 2019 from 10.

Continue reading

Rook CRD

Rook is a Cloud Native Storage solution, it creates CRDs which in turn create their corresponding storage pods and resources. Install Rook CRD Install Operator via helm chart. This is the foundation of all fun. helm repo add rook-release https://charts.rook.io/release helm install --namespace rook-ceph rook-release/rook-ceph -n rook Note: Rook Operator and CRD cluster must be in the same namespace, because CRD will use helm created serviceaccount to create all resources.

Continue reading

Mariadb in a Galera Cluster Maintenance and Recovery¶ Introduction¶ This document covers how to perform system maintenance with the Mariadb database in active production, and how to recover from power outages, or network failure. Environment¶ SCM (Scyld Cloud Manager) currently leverages the Kolla OpenStack project which packages the OpenStack services into Docker containers. The mariadb database utilizes galera to run a database cluster on the three OpenStack controller systems. The cluster provides high availability as well as scalability.

Continue reading

Terraform Hints

Terraform now provides new functions, making itself more powerful as a configuration mgmt tool. Here are some memos: To make var value join with fixed string: "test-${each.key}" cidrsubnet From Terraform official documents, they have an example as following and explained in none humane language way. So I'll try to interpret it from a Network Expert perspective. cidrsubnet(prefix, newbits, netnum) And an example below: > cidrsubnet("172.16.0.0/12", 4, 2) 172.18.0.0/16 > cidrsubnet("10.1.2.0/24", 4, 15) 10.

Continue reading

RedHat Certified System Engineer(EX300) is an advanced and extended exam from EX200. It tests what EX200 already covers and extend them in a more detailed and advanced level. Network Manager Besides what we already know about using nmcli to configure networks, RHCE would like to see if candidates would understand how nmcli and legacy script based config work. A table of Comparison of nm-settings and ifcfg-* Directives nmcli con mod ifcfg-* file Effect ipv4.

Continue reading

Author's picture

LuLU

Love coding and new technologies

Cloud Solution Consultant

Canada